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INTRODUCTION 


In  this,  the  final  report  of  our  progress  during  the  four  year  funding  period,  I  will 
summarize  our  accomplishments.  During  this  period  we  have  sought  to  find  tumor 
suppressor  genes  that  are  involved  in  breast  cancer.  Our  method  has  been  to  search  for 
homozygously  deleted  loci,  and  then  to  search  for  genes  within  these  loci. 

BODY 


We  applied  RDA,  a  molecular  tool  we  developed  for  difference  analysis  (3,4),  to 
human  breast  cancer.  We  assembled  a  collection  of  greater  than  500  biopsies,  and 
sorted  these  into  aneuploid  and  diploid  nuclei,  probably  the  largest  collection  of  this 
type  in  the  world.  Using  the  aneuploid  DNA  as  driver  and  diploid  DNA  as  tester,  we 
derived  over  100  probes  and  analysed  these  for  homozygous  deletion  in  this  collection. 
A  total  of  six  new  loci  were  thereby  discovered  and  confirmed. 

One  of  these,  on  chromosome  10,  was  among  the  first  to  be  analysed  and  led  to 
the  discovery  of  the  PTEN  tumor  suppressor  gene  (2).  PTEN  is  mutated  in  many 
glioblastomas,  prostate  and  breast  cancers  (6,9).  Germline  mutation  of  PTEN  occurs  in 
patients  with  Cowden's  disease,  an  inherited  condition  that  predisposes  to  neoplasms. 
Most  recently,  germline  knockouts  of  the  PTEN  gene  in  transgenic  mice  has  led  to  direct 
confirmation  that  PTEN  is  a  tumor  suppressor. 

PTEN  encodes  a  protein  phosphatase  (8)  that  also  possesses  phospholipid 
phosphatase  activity  (7).  In  particular,  PTEN  can  remove  the  3'phosphate  from  PIP3. 
Current  evidence  suggests  that  the  lipid  phosphatase  activity  is  directly  involved  in 
PTEN's  role  as  a  tumor  suppressor. 

The  remaining  loci  reside  on  chromosome  4,  two  on  8,  one  on  21  and  one  on  6. 
We  have  made  progress  defining  the  common  boundaries  of  the  homozygously  deleted 
regions,  a  prerequisite  for  gene  hunting.  The  region  on  chromosome  4  appears  to  be 
contained  with  a  few  hundred  kilobases,  those  on  8  and  21  to  within  a  megabase,  and 
we  have  just  begun  to  characterize  the  one  on  6.  To  conduct  these  studies  we  have 
implemented  some  new  technology  (quantitative  PCR)  and  invented  some  new 
technology,  namely  the  immortalizing  of  small  samples  of  tumor  DNA  by  preparing 
high  complexity  representations  (HCR),  and  wedded  the  two  together  so  that  we  can 
now  accurately  measure  gene  copy  number  in  minute  biopsy  samples  (5).  We  expect 
this  technique  may  be  widely  used  in  the  molecular  assessment  of  tumors  by  cancer 
biologists. 

We  also  developed  another  tool  for  gene  searching.  Often  genetic  analysis,  such 
as  ours,  defines  a  large  region  of  the  genome  which  may  contain  a  gene  of  particular 
interest.  Searching  for  the  gene  in  this  large  region  is  often  a  rate  limiting  step  in  the 
research.  Three  basic  techniques  are  employed:  DNA  sequence  analysis,  exon 
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trapping,  and  hybrid  selection.  Each  method  has  advantages  and  pitfalls.  We  have 
developed  a  new  method  based  on  hybrid  selection,  called  RICH  (1),  which  adds  to  the 
armamentoria  of  search  tools.  RICH  finds  genes  within  a  region  that  have  overlap  in 
sequence  with  collections  of  cDNAs. 


CONCLUSION 

In  summary,  we  successfully  applied  a  new  tool  (RDA)  to  the  search  for  tumor 
suppressor  genes  in  breast  cancer,  defined  one,  defined  the  function  of  the  protein 
product,  found  and  began  the  characterization  of  several  other  regions  that  harbor 
candidate  tumor  suppressor  genes,  and  invented  two  new  molecular  tools:  one  for 
analysis  the  genomes  of  clinical  cancer  specimens,  and  one  for  the  accelerated  discovery 
of  genes  in  large  genomic  regions. 

Although  RDA  using  tumor  as  driver  and  normal  as  tester  is  an  effective  way  to 
find  probes  deleted  in  the  tumor,  these  deletions  are  often  large,  and  the  search  for 
genes  within  the  deleted  region  is  very  time  consuming.  We  have  therefore  begun  new 
protocols,  using  tumor  as  tester  and  normal  as  driver  to  search  for  rearrangements, 
which  may  lead  to  the  more  rapid  discovery  of  target  genes. 
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Detection  of  Genetic  Loss  in  Tumors  by 
Representational  Difference  Analysis 

N.A.  Lisitsyn,*  F.S.  Leach, t  B.  Vogelstein,!  and  M.H.  Wigler* 

*Cold  Spring  Harbor  Laboratory ,  Cold  Spring  Harbor,  New  York  11724; 
tThe  Johns  Hopkins  Oncology  Center,  Baltimore,  Maryland  21231 


A  variety  of  genetic  lesions  are  found  in  tumors, 
including  DNA  losses,  point  mutations,  gene  amplifica¬ 
tions,  and  rearrangements  (Lasko  et  al.  1991;  Salomon 
et  al.  1991).  Frequent  losses  of  both  alleles  at  a  given 
locus  or  losses  of  one  allele  with  functional  inactivation 
of  the  other  have  been  detected  in  many  tumor  types. 
These  genetic  lesions,  manifesting  themselves  as  loss  of 
heterozygosity  (LOH)  and  hemizygous  and  homozy¬ 
gous  deletions,  have  been  found  to  be  the  hallmarks  of 
the  presence  of  tumor  suppressor  genes.  Many  ap¬ 
proaches  have  been  taken  in  the  past  to  identify  these 
genes,  but  recently  we  have  developed  a  new  method 
that  is  both  general  and  efficient  (Lisitsyn  et  al.  1993, 
1995).  The  method,  called  representational  difference 
analysis,  or  RDA,  is  designed  for  analyzing  the  differ¬ 
ences  between  complex  but  highly  related  genomes  and 
combines  three  elements:  representation,  subtractive 
enrichment,  and  kinetic  enrichment.  The  first  stage  of 
the  procedure  comprises  the  preparation  of  representa¬ 
tions  from  the  genomes,  during  which  DNAs  are  cut 
with  restriction  endonuclease,  ligated  to  oligonucleo¬ 
tide  adapters,  and  amplified  by  the  polymerase  chain 
reaction  (PCR).  Since  only  small  fragments  (  <  1  kb  in 
length),  called  ARFs,  are  efficiently  amplified  by  stan¬ 
dard  PCR  procedures,  representations  have  at  least 
tenfold  lower  complexity  than  initial  DNAs.  This 
enormously  increases  the  efficiency  of  the  second  stage, 
comprising  the  reiterative  hybridization /selection  steps 
during  which  ARFs  present  in  one  sample,  the  tester, 
but  not  in  the  other,  the  driver,  are  selectively  enriched. 

We  describe  here  the  application  of  RDA  to  discover 
sequences  that  are  lost  in  tumors. 

CLONING  SEQUENCES  LOST  IN  TUMORS 

We  performed  RDA  on  16  individual  pairs  of  tumor 
DNA  (used  to  derive  driver)  and  matched  normal 
DNA  (used  to  derive  tester)  from  the  same  patient.  We 
isolated  15  DNAs  from  tumor  cell  lines  (including  9 
renal  cell  and  6  colon  cancer  cell  lines)  with  normal 
DNAs  derived  from  unaffected  blood  or  tissue.  In  one 
case,  we  used  a  fluorescent  activated  cell  sorter  to 
fractionate  nuclei  from  an  esophageal  cancer  biopsy 
into  aneuploid  and  diploid  fractions  used  for  prepara¬ 
tion  of  driver  and  tester  DNA,  respectively. 

In  each  application  of  RDA,  difference  products 
were  cloned  and  analyzed  by  blot  hybridization.  The 
“informative”  probes  hybridized  to  DNA  from  the 
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normal  representation  but  not  the  tumor  representa¬ 
tion.  Some  of  these  probes  mapped  to  the  Y  chromo¬ 
some.  Loss  of  Y  chromosome  information  is  frequently 
observed  in  renal  cell  carcinomas  (Presti  et  al.  1991). 
Other  probes  detected  binary  polymorphisms  at  Bglll 
sites  and  were  presumed  to  reflect  loss  of  heterozygos¬ 
ity  in  tumor.  Finally,  some  probes  did  not  hybridize  at 
all  to  total  genomic  DNA  from  the  tumor.  Probes  of 
this  type  were  sequenced,  and  oligonucleotides  were 
derived  for  use  in  PCR  screening  of  genomic  DNA 
from  the  tester  and  driver  sources,  and  from  panels  of 
normal  human  and  tumor  cell  lines.  Occasionally,  we 
found  probes  that  did  not  hybridize  to  several  normal 
human  DNAs.  We  presume  that  these  probes  reflect 
hemizygous  loss  in  the  tumor  of  a  deletion  polymor¬ 
phism  common  in  the  human  population  (see  Table  1, 
footnote  c).  Table  1  summarizes  the  types  of  probes  we 
obtained  (for  further  details,  see  Lisitsyn  et  al.  1995). 

HOMOZYGOUS  LOSS  ON  CHROMOSOME  3p 

From  16  comparisons,  6  pairs  yielded  probes  that 
appear  to  detect  homozygous  loss  in  the  tumor  used  as 
driver.  Three  probes  were  found  to  be  homozygously 
deleted  in  other  DNAs  isolated  from  a  collection  of  100 
tumor  cell  lines  established  from  different  types  of 
cancer  but  not  from  normals  (N.  Lisitsyn  and  R.  Lucito, 
unpubl.).  Probe  758-6,  derived  from  a  patient  with 
Barrett’s  esophagus,  detected  frequent  losses  in  cancers 
of  the  digestive  tract.  Loss  detected  with  this  probe  has 
also  been  observed  in  breast,  bladder,  and  lung  tumors. 
This  probe  has  been  analyzed  in  greatest  detail. 

758-6  was  mapped  to  chromosomal  region  3p  by  PCR 
analysis  of  monochromosomal  human  /rodent  cell  hy¬ 
brids.  The  probe  was  used  for  screening  a  chromosome- 
3  cosmid  library,  and  a  cosmid  contig  was  built  by 
chromosome  walking  (see  Fig.  1).  Single-copy  probes 
were  derived  from  this  contig  and  used  to  screen  DNAs 
from  a  collection  of  human  colon  cancer  cell  lines  and 
xenografts.  Of  175  tumor  DNAs,  20  (11%)  lacked 
sequences  from  at  least  one  of  the  probes  from  this 
region.  In  contrast,  losses  were  observed  in  1  of  122 
lung  cancer  cell  lines  (S.  Bader  and  J.  Minna,  unpubl.). 
Figure  1  shows  the  different  patterns  of  loss  that  were 
observed  in  colorectal  cancer  cell  lines  and  xenografts. 

Additional  probes  derived  from  a  cosmid  contig  were 
used  in  hybridizations  to  Southern  blots  containing 
DNAs  harboring  deletions.  Several  hybridizing  restric- 
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Table  1.  Analysis  of  RDA  Probes  Derived  Using  Tumor  DNA  as 
Driver 


Selected  for  initial 
characterization 

Found  to  be 
informative* 

Renal  cell  carcinoma  cell  lines 

UOK  112  (male) 

13b 

13  (0/13/0) 

UOK  114  (female) 

12b 

4  (3/0/1) 

UOK  124  (female) 

12b 

4  (4/0/0) 

UOK  132  (male) 

10b 

9  (3/6/0) 

UOK  108  (female) 

2 

2  (2/0/0) 

UOK  111  (female) 

5 

5  (5/0/0) 

3  (2/lc/0) 

UOK  127  (male) 

3 

UOK  146  (female) 

3 

3  (l/lc/l) 

UOK  154  (female) 

5 

1  (1/0/0) 

Colon  cancer  cell  lines 

VACO  429  (male) 

2 

1  (0/0/1) 

VACO  441  (female) 

3 

3  (1/0/2) 

VACO  432  (male) 

2 

1  (1/0/0) 

VACO  456  (female) 

2 

1  (1/0/0) 

VACO  576  (female) 

2 

2  (2/0/0) 

RBX  (male) 

2 

1  (1/0/0) 

Barrett’s  esophagus 

BE  758  (male) 

5 

5  (0/4/1 e) 

Total: 

83 

58  (27/25/6) 

*  Entries  are  a(b,  c,  d),  where  a  is  the  total  number  of  probes  detecting  DNA 
loss  in  tumors,  judged  to  be:  b,  loss-of-heterozygosity;  c,  hemizygous  loss;  d, 
presumably  homozygous  loss  (see  Discussion).  All  but  two  probes  judged  to 
detect  hemizygous  loss  were  derived  from  the  Y  chromosome.  The  difference 
between  quantities  of  initially  selected  probes  (83)  and  informative  probes  (58) 
was  due  to  the  presence  of  the  repeat  sequences  (9  cases),  nonhuman  DNA 
contaminating  tester  (5  cases),  and  single-copy  sequences  present  in  both  tester 
and  driver  DNAs  (11  cases). 

bThe  difference  products  after  two  rounds  of  hybridization /selection  were 
cloned;  in  all  the  rest  of  the  experiments  cloning  was  performed  after  three 
rounds. 

c  Probes  127-1  and  146-1  were  found  to  be  deletion  polymorphisms,  absent  on 
both  autosomes  of  7  out  of  35  and  3  out  of  35  of  normal  humans,  respectively. 

d  Nuclei  from  a  biopsy  were  sorted  by  flow  cytometry  into  aneuploid  (tumor) 
and  diploid  (normal)  fractions. 

*  This  result  is  presumed,  but  was  not  confirmed  because  of  the  small  amount 
of  sorted  tumor  nuclei  available. 

tion  fragments  were  observed  in  tumors  that  were 
absent  in  normals,  presumably  as  a  result  of  rearrange¬ 
ments  occurring  at  the  ends  of  some  of  the  deletions. 

This  observation  rules  out  the  possibility  that  probe 
758-6  detects  a  deletion  polymorphism  and  that  the  loss 
of  sequences  is  caused  by  the  same  types  of  mechanisms 
that  underlie  loss  of  heterozygosity  at  polymorphic 
markers.  Work  is  in  progress  to  identify  transcribed 
sequences  from  this  region. 

DISCUSSION 

The  RDA  methodology  may  be  successfully  applied 
to  detection  of  DNA  losses  in  tumors,  readily  providing 
probes  that  detect  homozygous  deletions.  As  we  were 
able  to  demonstrate,  some  of  these  deletions  are  rela¬ 
tively  small  ( <  50  kb)  and,  thus,  positional  cloning  of 
genes  that  must  be  inactivated  in  tumors  becomes  much 
more  efficient,  as  compared  to  other  techniques  used 
for  this  purpose  (e.g.,  allelotyping,  linkage  analysis  of 
predispositions  in  families,  and  cytogenetic  studies  fol¬ 
lowed  by  microdissection).  Since  some  of  these  probes 


are  deleted  in  more  than  one  DNA  isolated  from  tumor 
cells,  it  is  possible  that  the  deleted  locus  contains  a  gene 
that  is  commonly  inactivated  in  tumors. 

It  is  well  documented  that  some  genes  known  to  be 
disrupted  by  homozygous  deletions  in  tumors  regulate 
cellular  growth,  differentiation,  and  genomic  stability. 
Some  of  these  genes  have  strong  tumor  suppressor 
phenotypes  after  transfection  into  tumor  cell  lines.  Our 
screening  technique  based  on  RDA  methodology  holds 
promise  for  the  identification  of  new  genes  participat¬ 
ing  in  these  or  other  processes.  We  have  taken  a  similar 
approach  for  the  cloning  of  dominant  oncogenes.  These 
are  frequently  amplified  in  tumors,  and  probes  detect¬ 
ing  amplifications  can  be  efficiently  cloned  by  RDA 
when  tumor  DNA  is  used  to  derive  tester  (Lisitsyn  et 
al.  1995).  Although  amplified  regions  are  usually  rather 
large,  one  can  map  candidate  oncogenes  more  precisely 
by  finding  the  minimal  region  common  to  all  amplifica¬ 
tions  at  a  given  locus.  The  use  of  RDA  for  the  analysis 
of  cancers  thus  opens  up  new  avenues  for  understand¬ 
ing  the  etiology  of  the  disease  and  for  the  development 
of  prognostic  and  diagnostic  markers. 
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DELETIONS  IN  COLORECTAL  TUMOR  CELL  LINES  IN  #758-6  REGION 
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Figure  1.  Physical  map  of  a  chromosome  3p  region.  On  top  are  shown  cosmids  from  a  region  isolated  by  chromosomal  walking. 
Patterns  of  the  homozygous  loss  of  STSs  (thick  bars)  detected  by  PCR  analysis  are  depicted  on  the  bottom.  Pluses  and  minuses 
indicate  presence  or  absence  of  the  probe  in  DNAs  from  colorectal  cancer  cells. 
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Mapping  of  homozygous  deletions  on  human  chromosome  1 0q23  has  led  to  the  isolation 
of  a  candidate  tumor  suppressor  gene,  PTEN,  that  appears  to  be  mutated  at  considerable 
frequency  in  human  cancers.  In  preliminary  screens,  mutations  of  PTEN  were  detected 
in  31  %  (13/42)  of  glioblastoma  cell  lines  and  xenografts,  100%  (4/4)  of  prostate  cancer 
cell  lines,  6%  (4/65)  of  breast  cancer  cell  lines  and  xenografts,  and  1 7%  (3/1 8)  of  primary 
glioblastomas.  The  predicted  PTEN  product  has  a  protein  tyrosine  phosphatase  domain 
and  extensive  homology  to  tensin,  a  protein  that  interacts  with  actin  filaments  at  focal 
adhesions.  These  homologies  suggest  that  PTEN  may  suppress  tumor  cell  growth  by 
antagonizing  protein  tyrosine  kinases  and  may  regulate  tumor  cell  invasion  and  metas¬ 
tasis  through  interactions  at  focal  adhesions. 


As  tumors  progress  to  more  advanced  stag¬ 
es,  they  acquire  an  increasing  number  of 
genetic  alterations.  One  alteration  that  oc¬ 
curs  at  high  frequency  in  a  variety  of  human 
tumors  is  loss  of  heterozygosity  (LOH)  at 
chromosome  10q23.  This  change  appears  to 
occur  late  in  tumor  development:  although 
rarely  seen  in  low-grade  glial  tumors  and 
early-stage  prostate  cancers,  LOH  at  10q23 
occurs  in  —70%  of  glioblastomas  (the  most 
advanced  form  of  glial  tumor)  and  —60%  of 
advanced  prostate  cancers  (1 ,  2).  This  pat¬ 
tern  of  LOH,  and  the  recent  finding  that 
wild-type  chromosome  10  suppresses  the  tu- 


morigenicity  of  glioblastoma  cells  in  mice, 
suggest  that  10q23  encodes  a  tumor  sup¬ 
pressor  gene  (3). 

To  identify  this  putative  tumor  suppres¬ 
sor  gene,  we  performed  representational  dif¬ 
ference  analysis  (RDA)  on  12  primary 
breast  tumors  (4).  A  probe,  CY17,  derived 
from  one  of  the  tumors  was  mapped  to 
chromosome  10q23,  near  markers  WI-9217 
and  WI-4264,  on  the  Whitehead-MIT  ra¬ 
diation  hybrid  map  (5).  To  map  the  loca¬ 
tion  of  CY17  more  precisely,  we  isolated 
three  yeast  artificial  chromosomes  (YACs) 
containing  CY17  that  are  present  on  the 
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sequence  tagged  site  (STS)-based  map  of 
the  human  genome  (6,  7).  These  YACs 
placed  CY17  slightly  centromeric  to  the 
position  determined  by  radiation  hybridiza¬ 
tion  and  precisely  identified  its  location 
(Fig.  1A).  Analysis  of  32  primary  invasive 
breast  cancers  revealed  LOH  in  this  region 
in  about  50%  of  the  samples.  No  homozy¬ 
gous  deletions  of  CY 1 7  were  detected  in  a 
panel  of  65  breast  tumor  cell  lines  (25)  and 
xenografts  (40)  (8),  so  eight  additional 
markers  were  analyzed  in  the  10q23  region 
(D10S579,  D10S215,  AFMA086WG9, 

D10S541,  AFM280WE1,  WI-10275,  WI- 
8733,  WI-6971).  We  identified  homozygous 
deletions  of  AFMA086WG9  in  two  xeno¬ 
grafts,  Bxll  and  Bx38  (Figs.  IB  and  2A) 
and  then  screened  a  bacterial  artificial 
chromosome  (BAC)  library  with  this  mark¬ 
er  (9).  Using  new  STSs  from  four  indepen¬ 
dent  BAC  clones,  we  determined  that  the 
minimal  region  of  deletion  was  within  BAC 
C  (Fig.  IB)  (10).  Homozygous  deletions  of 
AFMA086WG9  were  also  detected  in  two 
of  eight  glioblastoma  cell  lines,  three  of  34 
glioblastoma  xenografts,  and  two  of  four 
prostate  cancer  cell  lines  (11).  One  of  the 
glioblastoma  samples,  cell  line  A 172,  had 
the  same  deletion  pattern  as  the  original 
breast  xenografts;  the  deletions  in  the  other 
samples  were  larger  (Fig.  IB). 

To  confirm  the  presence  of  homozygous 
deletions,  we  hybridized  a  Southern  (DNA) 
blot  with  a  3 -kb  probe  derived  from  a 
genomic  clone  spanning  the  region  of  dele¬ 
tion  (12).  Xenografts  anticipated  to  have  a 
homozygous  deletion  did  not  hybridize  to 
this  probe;  the  control  xenografts  hybrid¬ 
ized  to  the  expected  3-kb  band  (Fig.  2B). 

We  identified  genes  within  the  10q23 
region  by  exon  trap  analysis  of  BACs  C  and 
D  (Fig.  IB)  (13).  Two  trapped  exons,  ET-1 
and  ET-2,  had  sequences  that  were  perfect 
matches  to  an  unmapped  UNIGENE  as¬ 
sembly  of  expressed  sequence  tags  (ESTs)  as 
well  as  several  unassembled  ESTs  (6). 
Clones  containing  the  ESTs  were  se¬ 
quenced  and  used  to  assemble  an  open  read¬ 


ing  frame  (ORF)  of  403  amino  acids  (Fig. 
3A).  To  verify  the  location  of  this  cDNA, 
we  obtained  the  intronic  sequence  around 
ET-1  by  directly  sequencing  BAC  C.  An 
STS  primer  pair  (ET-1)  was  generated  that 
mapped  back  to  BACs  A,  B,  and  C  (Fig. 
IB).  In  addition,  we  screened  the  Map  Pan¬ 
el  #2  monochromosome  human-rodent  hy¬ 
brid  panel  to  confirm  the  unique  location  of 
this  exon  on  chromosome  10  (14). 


Our  entire  panel  of  tumor  xenografts 
and  cell  lines  was  screened  with  this  primer 
pair,  and  we  identified  an  additional  glio¬ 
blastoma  cell  line  (DBTRG-05MG)  with  a 
deletion  of  180  base  pairs  (bp)  (Fig.  IB) 
(Fig.  2C).  Sequence  analysis  revealed  that 
the  deletion  had  removed  180  bp  of  exonic 
sequence  and  the  splice  donor  site  from  this 
225-bp  exon.  This  deletion  was  not  present 
in  52  normal  blood  samples  or  in  more  than 
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Fig.  1.  Region  of  homozy¬ 
gous  deletion  on  chromo¬ 
some  10q23.  (A)  The  STS- 
based  YAC  map  of  the  re¬ 
gion  surrounding  CY17. 

Marker  locations  are  taken 
from  the  Whitehead  STS- 
based  map.  RH  indicates 
the  radiation  hybrid  interval 
for  CY17.  CY17  positive 
YAC  addresses  are  indicat¬ 
ed.  YAC  map  indicates  the 
interval  containing  CY17  in¬ 
ferred  from  the  YAC  ad¬ 
dresses.  Cen.,  centromere; 

Tel.,  telomere.  (B)  Map  of 
homozygous  deletions  on 
10q23,  showing  the  STS 
markers  spanning  the  delet¬ 
ed  region,  the  four  BACs 
overlapping  the  region,  and 
the  location  of  PTEN  with  re¬ 
spect  to  the  STS  markers. 

STS  markers  Not-5\  PTPD, 
and  ET-1  contain  exonic  se¬ 
quences  of  the  PTEN  gene. 

Absence  of  homozygous 
deletion  is  indicated  with  a 
“+"  and  presence  of  ho¬ 
mozygous  deletion  with  a 
Numbers  to  the  right  in¬ 
dicate  the  fraction  of  tumor 
cell  lines  and  xenografts  with  the  deletion.  The  two  breast  cancer  samples  with  a  deletion  are  xenografts 
Bxl  1  and  Bx38.  Glioblastoma  line  A1 72  has  a  deletion  encompassing  markers  JL25  through  KP8  and 
glioblastoma  line  DBTRG-05MG  has  a  deletion  affecting  only  ET-1 .  The  glioblastoma  samples  with  a 
deletion  across  the  entire  region  are  the  cell  line  U1 05  and  xenografts  2, 3,  and  1 1 ,  and  the  samples  with 
deletion  of  only  PTPD,  which  contains  the  phosphatase  domain,  are  xenografts  22,  23,  24,  25,  and  32. 
The  prostate  cancer  cell  lines  with  homozygous  deletions  are  NCI  H660  and  PC-3.  The  5'  end  of  the 
PTEN  cDNA  was  determined  to  be  coincidental  with  the  Not  I  site  20  kb  from  the  centromeric  end  of 
BAC  D  by  sequence  analysis.  These  maps  are  not  drawn  to  scale. 
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Table  1.  Summary  of  PTEN  mutations  in  tumor  cell  lines  and  primary  tumors. 


Tumor  sample 

Tissue  of  origin 

Codon 

Mutation* 

Predicted  effect 

LNCaP 

Prostate 

6 

AAA  to  A 

Frameshift 

534Tf 

Glioblastoma 

15 

AGA  to  AGAGA 

Frameshift 

U87MG 

Glioblastoma 

54 

49  bp  deletion 

Frameshift 

MDA-MB-468 

Breast 

70 

44  bp  deletion 

Frameshift 

132TT 

Glioblastoma 

129 

GGA  to  AGA 

Gly  to  Arg 

DU  145 

Prostate 

134 

ATGtoTTG 

Met  to  Leu 

U373MG 

Glioblastoma 

241 

TTTtoTTTTT 

Frameshift 

BT549 

Breast 

274 

GTA  AAT  to  TAA  AT 

Stop 

DBTRG-05MGI 

Glioblastoma 

274-342 

Delete  204  bp 

In-frame  deletion 

134Tf 

Glioblastoma 

337 

4  bp  deletion 

Frameshift 

mutations  in  the  primary  tumors  were  not  found  in  matched  blood  DNA.  $DBTRG-05MG  has  a  genomic  deletion 
of  180  bp  within  exon  ET-1 ,  which  includes  the  splice  donor  site.  Because  of  this  deletion,  the  transcript  contains  an 
in-frame  deletion  of  codons  274  to  342. 
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125  other  primary  tumors,  xenografts,  and 
cell  lines  tested. 

Sequence  analysis  of  the  ORF  revealed 
a  protein  tyrosine  phosphatase  domain 
(Fig.  3B)  and  a  large  region  of  homology 
( — 175  amino  acids)  to  chicken  tensin  and 
bovine  auxilin  (Fig.  3C).  We  therefore 
call  the  gene  PTEN  for  Phosphatase  and 
Tensin  homolog  deleted  on  chromosome 
Ten.  The  phosphatase  domain  of  the  P- 
TEN  protein  contained  the  critical  (I/V)- 
H-C-X-A-G-X-X-R-(S/T)-G  motif  found 
in  tyrosine  and  dual-specificity  phospha¬ 
tases  (15).  The  phosphatase  domain  exon 
mapped  within  all  four  BACs  and  was 
deleted  in  all  of  the  samples  with  homozy¬ 
gous  deletions  except  for  DBTRG-05MG. 
These  results  thus  placed  this  exon  within 
the  region  of  homozygous  deletion  near 
JL25  and  AFMA086WG9  (Fig.  IB).  We 
then  screened  the  remaining  xenografts  and 
cell  lines  for  additional  homozygous  dele¬ 
tions  and  identified  five  more  glioblastoma 
xenografts  lacking  this  exon.  These  data 
indicate  that  the  phosphatase  domain  en¬ 
coded  by  PTEN  was  targeted  for  mutations 
in  tumor  xenografts  and  cell  lines. 

The  phosphatase  domain  of  P-TEN  is 
most  related  in  sequence  to  those  of  CDC14, 
PRL-1  (phosphatase  of  regenerating  liver), 
and  BVP  (baculo virus  phosphatase)  (Fig. 
3B).  CDC14  and  BVP  are  dual-specificity 
phosphatases  that  remove  phosphate  groups 
from  tyrosine  as  well  as  serine  and  threonine 
(16).  These  phosphatases  can  be  distin¬ 
guished  from  the  better  characterized  VH1- 
like  enzymes  by  sequence  differences  outside 
of  the  core  conserved  domain.  Both  PRL-1 
and  CDC14  are  involved  in  cell  growth,  and 
CDC14  appears  to  play  a  role  in  the  initia¬ 
tion  of  DNA  replication  (17).  In  contrast  to 
P-TEN,  these  phosphatases  do  not  have  ex¬ 
tensive  homology  to  tensin  and  auxilin.  P- 
TEN  is  also  homologous  to  the  protein  ty¬ 
rosine  phosphatase  domains  of  three  ORFs 
(Y50.2,  PTP-IV1,  CPTPH)  whose  protein 
products  have  not  been  characterized.  Of 
these  hypothetical  proteins,  only  the  puta¬ 
tive  yeast  phosphatase  Y50.2  has  significant 
homology  to  tensin.  Although  tensin  and 
auxilin  are  not  expected  to  have  phospha¬ 
tase  activity,  they  both  contain  elements  of 
the  protein  tyrosine  phosphatase  signature 
sequence  (18),  which  suggests  that  they  may 
share  a  tertiary  structure  with  these  enzymes 
(19). 

If  PTEN  is  a  tumor  suppressor  gene,  the 
PTEN  allele  retained  in  tumor  cells  with 
LOH  should  contain  inactivating  mutations. 
To  search  for  such  mutations,  we  performed 
a  protein  truncation  test  on  20  breast,  six 
glioblastoma,  and  two  prostate  tumor  cell 
lines  (20).  Two  truncating  mutations  in 
PTEN  were  identified  in  the  breast  samples 
(Table  1).  BT549  cells  had  a  1-bp  deletion  of 
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1234567  8910  ments  resolved  on  a  1%  agarose  gel,  and  transferred  to  a  nylon 
membrane.  The  blot  was  probed  with  a  3- kb  Eco  Rl  fragment 
containing  the  STS  marker  JL25,  which  is  within  the  region  of 
homozygous  deletion  (top),  or  to  a  second  2-kb  Eco  Rl  fragment 
from  chromosome  8  (bottom).  Lane  M,  bacteriophage  lambda 
Hind  III  marker.  Other  lanes  contain  DNA  from  breast  xenografts 
10,  11,  19,  and  38  and  brain  xenografts  2,  3,  and  11.  Breast 
xenografts  10  and  19  were  loaded  as  controls  and  were  not 
expected  to  have  homozygous  deletions.  (C)  Homozygous  de- 
letions  of  exon  ET-1  in  glioblastoma  cell  lines.  Genomic  DNA 
samples  were  PCR  amplified  using  intronic  primers  that  amplify  exon  ET-1 .  The  products  were  resolved 
on  a  1 .2%  agarose  gel  and  then  stained  with  ethidium  bromide.  Lane  1  contains  a  DNA  marker.  The 
remaining  lanes  contain  PCR  products  from  control  templates  and  seven  glioblastoma  cell  lines:  lane  2, 
lymphocyte  DNA;  lane  3,  water;  lane  4,  U1 18MG;  lane  5,  A172;  lane  6,  DBTRG-05MG;  lane  7,  U373; 
lane  8,  T-98G;  lane  9,  U-87MG;  and  lane  10,  U138MG.  Full-length  products  are  present  for  all  templates 
except  water,  A172,  and  DBTRG-05MG. 
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Fig.  3.  (A)  Predicted  amino  acid  sequence  of  P-TEN.  The  putative  phosphatase  domain  is  under¬ 
lined.  The  nucleotide  sequence  has  been  deposited  in  GenBank  (accession  number  U93051). 
Abbreviations  for  amino  acids  are  A,  Ala;  C,  Cys;  D,  Asp;  E,  Glu;  F,  Phe;  G,  Gly;  H,  His;  I,  lie;  K,  Lys; 
L,  Leu;  M,  Met;  N,  Asn;  P,  Pro;  Q,  Gin;  R,  Arg;  S,  Ser;  T,  Thr;  V,  Val;  W,  Trp;  and  Y,  Tyr.  (B)  Homology 
of  P-TEN  to  protein  tyrosine  phosphatases.  The  sequence  alignment  was  performed  by  ClustalW 
(http://dot.imgen.bcm.tmc.edu:9331/multi-align/Options/clustalw.html).  The  National  Center  for 
Biotechnology  Information  (NCBI)  ID  numbers  are  P53916  (Y50.2),  M61194  (CDC14),  A56059 
(PRL1),  1 246236  (PTP-IV1 ),  1 1 2581 2  (CPTPH),  and  P24656  (BVP).  Black  boxes  indicate  amino  acid 
identities  and  gray  boxes  indicate  similarities.  (C)  Homology  of  P-TEN  to  chicken  tensin  and  bovine 
auxilin.  Alignment  was  performed  as  in  (B)  over  the  region  of  highest  homology.  NCBI  ID  numbers  are 
A54970  (tensin)  and  485269  (auxilin). 
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a  G,  leading  to  the  formation  of  a  stop  codon 
TAA  (Fig.  4A),  and  MDA-MB-468  cells 
had  a  deletion  of  44  bp  at  codon  70,  which 
resulted  in  a  frameshift  on  the  amino  termi¬ 
nal  side  of  the  tyrosine  phosphatase  domain. 
Mutations  in  PTEN  were  also  identified  in 
three  of  the  six  glioblastoma  cell  lines: 
DBTRG-05MG  cells  had  an  in-frame  dele¬ 
tion  of  204  bp  caused  by  the  genomically 
deleted  exon  ET-1  (Fig.  4B),  U373MG  had 
a  2-bp  insertion  at  codon  242,  and  U87MG 
had  a  frameshift  at  codon  54-  Both  of  the 
prostate  tumor  cell  lines  had  PTEN  muta¬ 
tions:  LNCaP  cells  had  a  2-bp  deletion  at 
codon  6,  leading  to  a  frameshift  (Fig.  4C), 
and  DU  145  cells  had  a  Met  — >  Leu  substi¬ 
tution  at  codon  134,  within  the  phosphatase 
domain.  The  latter  mutation  was  detected  by 
a  change  in  the  pattern  of  in  vitro  transla¬ 
tion  initiation  and  was  not  found  in  >50 
other  alleles  tested.  However,  Met- 134  is  not 
required  for  phosphatase  activity  (Fig.  3B), 
so  this  alteration  could  be  a  polymorphism. 
With  one  exception  (DU  145),  all  of  the  cell 
lines  retained  a  mutant  PTEN  allele  and  lost 
the  other  allele,  indicating  that  these  cells 
are  null  for  PTEN. 

To  determine  whether  PTEN  mutations 
are  present  in  primary  tumors,  we  screened 
genomic  DNA  from  18  primary  glioblasto¬ 
mas  for  mutations  in  three  exons  (21).  Mu¬ 
tations  in  PTEN  were  found  in  three  of 
these  tumors:  a  2-bp  insertion  at  codon  15 
(534  T),  a  point  mutation  resulting  in  a  Gly 
— >  Arg  change  at  codon  129  (132T),  and  a 
4-bp  frameshift  mutation  at  codon  337 
(134T)  (Table  1  and  Fig.  4D).  The  muta¬ 
tion  at  codon  129  is  within  the  signature 
sequence  for  tyrosine  phosphatases  (Fig. 
3B).  All  three  tumors  appeared  to  have 


LOH  in  the  PTEN  region  since  the  wild- 
type  allele  was  substantially  reduced  in  in¬ 
tensity.  In  addition,  the  tumor  mutations 
were  not  detected  in  paired  blood  DNA. 

In  summary,  we  detected  homozygous 
deletions,  frameshift,  or  nonsense  muta¬ 
tions  in  PTEN  in  63%  (5/8)  of  glioblastoma 
cell  lines,  100%  (4/4)  of  prostate  cancer  cell 
lines,  and  10%  (2/20)  of  breast  cancer  cell 
lines.  These  frequencies  are  likely  to  be 
underestimates  since  the  cell  lines  were  not 
systematically  screened  for  point  mutations. 
We  screened  xenografts  only  for  homozy¬ 
gous  deletions  in  PTEN  and  detected  them 
in  24%  (8/34)  of  glioblastoma  xenografts 
and  5%  (2/40)  of  breast  cancer  xenografts. 
Finally,  we  detected  PTEN  mutations  in 
17%  (3/18)  of  primary  glioblastomas;  this 
frequency  is  also  likely  to  be  an  underesti¬ 
mate  since  the  entire  coding  sequence  was 
not  analyzed.  The  results  of  these  prelimi¬ 
nary  screens  suggest  that  a  large  fraction  of 
glioblastomas  and  advanced  prostate  can¬ 
cers  may  harbor  PTEN  mutations,  whereas 
the  mutation  frequency  in  breast  cancer 
may  be  lower.  Future  systematic  analysis  of 
all  tumor  types  will  be  of  interest. 

The  likely  function  of  the  P-TEN  tumor 
suppressor  as  an  enzyme  that  removes  phos¬ 
phate  from  tyrosines  is  intriguing,  given 
that  many  oncoproteins  function  in  the 
reverse  process — to  phosphorylate  tyrosines 
(22).  P-TEN  and  tyrosine  kinase  oncopro¬ 
teins  may  share  substrates  and  the  tight 
control  of  these  substrates  through  phospho¬ 
rylation  is  likely  to  regulate  a  critical  path¬ 
way  that  is  altered  late  in  tumor  develop¬ 
ment.  The  homology  of  P-TEN  to  tens  in  is 
also  of  interest.  Tensin  appears  to  bind  ac- 
tin  filaments  at  focal  adhesions — complex¬ 


Fig.  4.  Mutations  of  PTEN  in  cancer 
cell  lines  and  primary  tumors.  (A) 
Mutation  in  breast  cancer  cell  line 
BT549.  Sequence  of  nucleotides 
831  to  785  (bottom  to  top)  using  an 
antisense  primer  shows  a  deletion 
of  a  C  (arrow)  in  sample  on  the  right 
(BT549)  but  not  in  the  sample  on 
the  left  (breast  cancer  cell  line  ZR- 
75-30).  (B)  Mutation  in  glioblastoma 
cell  line  DBTRG-05MG.  Sequence 
of  nucleotides  1039  to  1010  in  the 


antisense  orientation  from  prostate  cancer  cell  lines  DU1 45  (left), 


LNCaP  (middle),  and  the  glioblastoma  cell  line  DBTRG-05MG 
D  (right).  Arrow  indicates  the  in-frame  deletion  of  nucleotides  822 

—  to  1025  in  DBTRG-05MG.  (C)  Mutation  in  prostate  cancer  cell 

line  LNCaP.  Sequence  of  nucleotides  34  to  2  of  the  prostate 
cancer  cell  line  LNCaP  (left)  and  the  glioblastoma  cell  line 
DBTRG-05MG  (right)  using  an  antisense  primer.  Arrow  indicates 
the  deletion  of  two  T  nucleotides  in  LNCaP.  (D)  Mutation  in 
primary  glioblastoma  534.  Sequence  of  nucleotides  26  to  63  of 
genomic  DNA  from  blood  (left)  and  primary  tumor  534  (right) 
from  the  same  patient  using  a  sense  primer.  Arrow  indicates 
insertion  of  AG  in  the  tumor  DN  A.  A,  C,  G,  and  T  lanes  are  loaded 
next  to  each  other  to  allow  better  detection  of  mutations. 


es  that  contain  integrins,  focal  adhesion 
kinase  (FAK),  Src,  and  growth  factor  recep¬ 
tors  (23).  Integrins  have  been  implicated  in 
cell  growth  regulation  (24)  and  in  tumor 
cell  invasion,  angiogenesis,  and  metastasis 
(25),  so  it  is  conceivable  that  PTEN  regu¬ 
lates  one  or  more  of  these  processes.  Finally, 
the  identification  of  P-TEN  as  a  likely  tu¬ 
mor  suppressor  raises  the  possibility  that 
this  protein  and  its  substrates  will  be  useful 
targets  for  the  development  of  new  thera¬ 
peutics  for  cancer. 
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ABSTRACT  Protein  tyrosine  phosphatases  (PTPs)  have 
long  been  thought  to  play  a  role  in  tumor  suppression  due  to 
their  ability  to  antagonize  the  growth  promoting  protein 
tyrosine  kinases.  Recently,  a  candidate  tumor  suppressor 
from  10q23,  termed  P-TEN,  was  isolated,  and  sequence  ho¬ 
mology  was  demonstrated  with  members  of  the  PTP  family,  as 
well  as  the  cytoskeletal  protein  tensin.  Here  we  show  that 
recombinant  P-TEN  dephosphorylated  protein  and  peptide 
substrates  phosphorylated  on  serine,  threonine,  and  tyrosine 
residues,  indicating  that  P-TEN  is  a  dual-specificity  phospha¬ 
tase.  In  addition,  P-TEN  exhibited  a  high  degree  of  substrate 
specificity,  showing  selectivity  for  extremely  acidic  substrates 
in  vitro.  Furthermore,  we  demonstrate  that  mutations  in 
P-TEN,  identified  from  primary  tumors,  tumor  cells  lines,  and 
a  patient  with  Bannayan-Zonana  syndrome,  resulted  in  the 
ablation  of  phosphatase  activity,  demonstrating  that  enzy¬ 
matic  activity  of  P-TEN  is  necessary  for  its  ability  to  function 
as  a  tumor  suppressor. 


A  variety  of  techniques  have  been  used  to  identify  genes 
involved  in  the  etiology  of  cancer.  From  these  studies,  a 
surprisingly  large  number  of  protein  tyrosine  kinases  (PTKs) 
have  been  implicated  in  carcinogenesis  (1).  The  PTKs  are 
activated  by  amplification,  deletion,  or  mutation  of  important 
negative  regulatory  domains,  or  by  genetic  rearrangements 
that  result  in  the  production  of  activated  fusion  proteins  (1,  2). 
Further  support  for  the  importance  of  tyrosine  phosphoryla¬ 
tion  in  oncogenesis  comes  from  the  finding  that  expression  of 
v-crk,  a  small  adaptor  protein  that  does  not  contain  intrinsic 
PTK  activity,  results  in  an  increase  in  the  levels  of  cellular 
phosphotyrosine  and  cellular  transformation  (3).  The  critical 
role  that  tyrosine  phosphorylation  plays  in  oncogenesis  has  led 
to  the  suggestion  that  many  protein  tyrosine  phosphatases 
(PTPs)  would  act  as  tumor  suppressors.  Although  PTPs  have 
been  linked  to  the  inhibition  of  cell  proliferation,  there  had 
been  no  clear  cut  examples  of  these  enzymes  functioning  as 
tumor  suppressors. 

P-TEN,  a  candidate  tumor-suppressor  gene  identified  on 
chromosome  10,  also  known  as  MMAC1,  shares  homology 
with  the  PTP  family,  as  well  as  with  the  cytoskeletal  protein 
tensin  (4, 5).  P-TEN  was  isolated  from  a  locus  on  chromosome 
10,  10q22-23,  which  is  deleted  in  a  large  number  of  tumors, 
especially  glioblastomas  (4,  5).  Subsequently,  it  was  shown  that 
P-TEN  is  deleted  or  mutated  in  a  significant  fraction  of 
glioblastomas  and  prostate  tumors  (4,  5).  Importantly,  germ¬ 
line  mutations  in  P-TEN  give  rise  to  Cowden  disease,  which  is 
typified  by  the  formation  of  multiple,  benign  tumors  and  an 
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increased  susceptibility  to  malignant  cancers.  Mutations  in 
P-TEN  were  also  found  in  patients  suffering  from  disorders 
similar  to  Cowden  disease,  such  as  Lhermitte-Duclos  disease, 
which  has  additional  pathologies,  including  ataxia,  macroceph- 
aly,  and  dysplastic  cerebellar  gangliocytomatosis  (6,  7).  The 
detection  of  germ-line  mutations  in  four  of  five  Cowden/ 
Lhermitte-Duclos  kindreds  verified  that  P-TEN  functions  as 
a  tumor  suppressor  and  also  suggests  that  P-TEN  plays  a  role 
in  the  proper  development  and  formation  of  certain  tissues  (8). 
Recent  data  also  suggest  that  germ-line  mutations  of  P-TEN 
are  also  responsible  for  Bannayan-Zonana  syndrome  (C.E., 
unpublished  data),  an  autosomal  dominant  disorder,  that  in 
addition  to  the  mental  retardation,  macrocephaly,  and  thyroid 
disease  shared  with  Cowden  disease,  is  characterized  by  lipo¬ 
matosis,  speckled  penis,  and  an  early  onset  of  the  neoplastic 
disease  (9). 

All  PTPs  contain  the  catalytic  signature  motif 
HCXXGXXRS/T  (10).  The  cysteine  residue  in  this  motif  is 
absolutely  required  for  catalysis,  as  it  acts  as  a  nucleophile  to 
attack  the  phosphorous  atom  in  the  phosphate  moeity  of  its 
substrate,  forming  a  thiol-phosphate  intermediate  (11).  Mu¬ 
tation  of  this  cysteine  to  serine  or  alanine  results  in  the 
complete  loss  of  phosphatase  activity  (12).  The  dual-specificity 
phosphatases,  which  catalyze  the  hydrolysis  of  phospho-seryl, 
-threonyl,  and  -tyrosyl  residues,  also  contain  the  canonical 
PTP  catalytic  motif  (13).  The  crystal  structure  of  PTP1B,  the 
prototypic  PTP,  reveals  that  the  stringent  amino  acid  selec¬ 
tivity  of  the  PTPs  is  determined,  in  part,  by  the  location  of  the 
catalytic  cysteine  at  the  base  of  a  cleft  (11,  14).  The  depth  of 
the  cleft  (9  A)  matches  the  length  of  a  phosphotyrosine  residue 
and  the  shorter  phospho-seryl  and  -threonyl  residues  are  not 
able  to  reach  the  catalytic  cysteine.  In  addition,  the  deep  cleft 
is  lined  with  hydrophobic  residues  that  help  stabilize  the 
interaction  with  the  hydrophobic  tyrosine  moiety  (12).  In 
contrast  to  PTP1B,  the  phosphate-binding  loop  in  the  dual¬ 
specificity  phosphatase  VHR  is  at  the  base  of  a  much  shallower 
cleft,  which  can  accommodate  all  three  phosphorylated  hy¬ 
droxyl  amino  acids  (15).  Regardless  of  the  depth  of  the 
catalytic  pocket,  both  classes  of  enzymes  proceed  through 
similar  steps  of  catalysis.  The  initial  nucleophilic  attack  of  the 
cysteine  residue  results  in  the  formation  of  an  enzyme- 
substrate  complex.  This  complex  is  disrupted  by  the  catalytic 
acid  (Asp-181  in  PTP1B  and  Asp-92  in  VHR)  which  proto- 
nates  the  phenolic  oxygen  of  the  tyrosyl  group,  releasing  the 
dephosphorylated  substrate  from  the  complex.  The  phospho¬ 
rous  atom  remains  associated  with  the  active  site  cysteine  as  a 
thiol-phosphate.  In  the  case  of  the  PTPs,  the  active  enzyme  is 
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regenerated  by  the  hydrolysis  of  the  thiol-phosphate  bond  by 
a  water  molecule,  possibly  activated  by  the  same  aspartate 
residue  (11, 14, 15).  The  efficiency  of  this  reaction  is  illustrated 
by  the  finding  that  PTP1B  can  undergo  *» 2,000  catalytic  cycles 
per  minute  (12). 

Recent  data  indicate  that  the  PTPs  exhibit  a  great  deal  of 
substrate  specificity  in  vivo  and  are  not  simply  unregulated 
antagonists  of  the  signals  mediated  by  the  PTKs  (12,  16).  In 
fact,  PTPs  can  exert  both  positive  and  negative  effects  on 
signaling  pathways,  indicating  that  they  do  not  simply  function 
as  “off  switches”  (17, 18).  In  spite  of  the  large  number  of  PTKs 
that  have  been  shown  to  play  a  role  in  oncogenesis  (1),  there 
were  no  examples  of  PTPs  that  function  as  classical  tumor 
suppressors.  Although  P-TEN  has  been  identified  as  a  tumor 
suppressor  and  displays  some  of  the  structural  features  of  the 
PTP  family,  its  phosphatase  activity  has  not  been  character¬ 
ized.  We  have  expressed  P-TEN  in  Escherichia  coli  as  a 
glutathione  5-transferase  (GST)  fusion  protein  and  demon¬ 
strated  that  P-TEN  is  catalytically  active.  Purified  GST-P- 
TEN  showed  a  strong  preference  for  only  the  most  acidic 
substrates.  Furthermore,  P-TEN  was  able  to  dephosphorylate 
serine,  threonine,  and  tyrosine  residues,  establishing  it  as  a 
dual-specificity  phosphatase.  Finally,  we  demonstrated  that 
the  phosphatase  activity  of  P-TEN  is  necessary  for  its  ability 
to  function  as  a  tumor  suppressor,  because  a  variety  of  point 
mutations,  derived  from  tumor  samples  or  Bannayan-Zonana 
syndrome,  ablated  P-TEN  activity. 

MATERIALS  AND  METHODS 

Expression  and  Purification  of  GST-P-TEN.  A  full-length 
P-TEN  cDNA  was  generated  by  ligating  the  Notl-Bglll  frag¬ 
ment  from  EST264611  with  the  Bg/II-EcoRI  fragment  of 
EST365465  into  pBluescript  digested  with  Afo/I-EcoRI.  The 
resulting  full-length  P-TEN  cDNA  was  amplified  by  PCR  using 
pfu  polymerase  (Stratagene)  and  primers  that  add  a  5'  BamHl 
site  (5'-CGCGGATCCATGACAGCCATCATCAAA- 
GAGATCGTTAGC)  and  a  3'  EcoRI  site  (5'-CGCGAAT- 
TCTCAGACTTTTGTAATTTGTGTATGC).  The  resulting 
fragment  was  subcloned  into  pGEX2T  (Pharmacia),  and  the 
sequence  was  verified  by  automated  sequencing.  Expression  of 
P-TEN  was  induced  in  500  ml  of  mid-log  phase  bacteria  (Asoo 
=  0.600)  by  the  addition  of  isopropyl  /3-D-thiogalactoside  to 
200  /aM.  The  culture  was  shifted  to  room  temperature  and 
expression  was  allowed  to  proceed  for  12  h.  The  bacteria  were 
harvested  by  centrifugation,  the  supernatant  was  removed,  and 
the  bacterial  pellet  was  frozen  at  -80°C.  The  frozen  pellets 
were  resuspended  in  5  ml  of  ice  cold  20  mM  Tris,  150  mM 
NaCl,  and  5  mM  EDTA  (pH  8.0)  supplemented  with  lysozyme 
(1  mg/ml),  aprotinin  (5  /Ag/ml),  leupeptin  (5  /Ag/ml),  and 
benzamidine  (l  mM)  and  incubated  on  ice  for  15  min.  The 
bacteria  were  lysed  by  sonicating  three  times  for  1  min  each 
with  a  Branson  model  450  sonifier,  power  setting  4,  70%  duty 
cycle.  The  lysate  was  cleared  by  centrifugation  at  30,000  X  g  for 
10  min,  diluted  with  an  equal  volume  of  HBS  (50  mM 
Hepes/150  mM  NaCl,  pH  7.4).  Glutathione-Sepharose  4B 
(300  p.1)  was  added,  and  the  resulting  slurry  was  incubated  at 
4°C  on  a  rocking  platform  for  1-2  h.  The  glutathione- 
Sepharose  was  washed  five  times,  each  with  10  ml  of  ice  cold 
HBS,  and  the  washed  fusion  proteins  were  eluted  with  a 
solution  containing  20  mM  glutathione,  50  mM  Hopes,  and 
30%  glycerol  (pH  8.0).  Protein  concentrations  were  deter¬ 
mined  by  the  method  of  Bradford,  using  BSA  as  a  standard, 
and  the  integrity  of  the  fusion  proteins  were  verified  by 
SDS/PAGE.  Mutations  identified  from  tumor  samples  and 
cell  lines  [refs.  4  and  5;  and  L.  Hedrick  (Johns  Hopkins 
University)  personal  communication]  or  identified  from  Cow- 
den  disease  (8)  and  Bannayan-Zonana  (C.E.,  unpublished 
data)  were  introduced  into  pGEX2T-P-TEN  using  the  Quick- 
change  mutagenesis  kit  as  described  by  the  manufacturer 
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(Stratagene).  For  all  mutations,  the  entire  P-TEN  ORF  was 
sequenced  to  confirm  that  no  other  mutations  had  been 
introduced. 

Substrate  Preparation.  All  tyrosine-phosphorylated  sub¬ 
strates  were  phosphorylated  with  the  cytoplasmic  fragment  of 
the  B  subunit  of  the  insulin  receptor  kinase  (0-IRK)  and 
purified  as  described  (19).  Serine-phosphorylated  substrates 
were  phosphorylated  with  recombinant  protein  kinase  A  (New 
England  Biolabs)  or  with  recombinant  casein  kinase  II  (a  gift 
from  D.  Litchfield,  University  of  Western  Ontario)  in  a 
reaction  mixture  consisting  of  50  mM  Hepes  (pH  7.2),  10  mM 
MgCI2,  2  mM  DTT,  2  mM  ATP,  and  1  mCi  of  [y-32P]ATP  (1 
Ci  =  37  GBq)  in  a  total  volume  of  1  ml.  Casein  was  used  at  a 
concentration  of  10  mg/ml,  myelin  basic  protein  (MBP)  at  4 
mg/ml,  and  the  peptides  RRRDDDSDDD  (DSD)  and 
RRREEETEEE  (ETE)  were  used  at  a  concentration  of  0.5 
mg/ml.  Protein  substrates  were  precipitated  by  the  addition  of 
ammonium  sulfate  to  80%,  incubated  on  ice  for  30  min,  and 
harvested  by  centrifugation.  The  precipitated  proteins  were 
washed  three  times  with  80%  ammonium  sulfate  and  then 
resuspended  in  500  /a!  of  1  M  Hepes  (pH  7.5).  The  solubilized 
proteins  were  dialyzed  against  several  changes  of  50  mM 
imidizole  (pH  7.2).  Peptide  substrates  were  purified  using  a 
Sep-pak  C18  reverse  phase  cartridge  (Waters)  as  described 
(20).  The  purified  peptides  were  lyophilized  to  dryness  and 
resuspended  in  50  mM  imidizole  (pH  7.2).  Random  copoly¬ 
mers  of  glutamate  and  tyrosine,  with  a  4:1  ratio  of  glutamate 
to  tyrosine  (polyGlu4Tyri)  or  with  a  1:1  ratio  (potyGluiTyr,), 
were  purchased  from  Sigma  and  phosphorylated  with  /3-IRK 
at  a  final  polymer  concentration  of  1  mg/ml  and  purified,  as 
described,  using  Sep-pak  C18  reverse  phase  chromatography. 
Phosphorylated  ERK2  was  produced  in  E.  coli  by  coexpression 
of  activated  MEK  and  was  a  gift  from  D.  Barford  (Oxford 
University). 

Phosphatase  Assays.  The  standard  phosphatase  assay  con¬ 
tained  10  /aM  substrate,  50  mM  Hepes  (pH  7.0),  10  mM  MgCl2, 
and  10  mM  DTT.  The  reaction  was  initiated  by  the  addition  of 
enzyme,  typically  1-2  p.g,  to  prewarmed  (30°C)  substrate  mix, 
resulting  in  a  final  volume  of  60  p.1.  The  reactions  were  allowed 
to  proceed  at  30eC  for  the  indicated  times  and  stopped  by  the 
addition  of  a  suspension  of  activated  charcoal  in  900  mM  HC1, 
90  mM  NaPPi,  and  2  mM  NaPi  (21).  Dephosphorylation  of 
ERK2  was  performed  essentially  as  described  for  the  radio¬ 
active  substrates;  at  the  indicated  times  duplicate  aliquots  were 
removed  and  stopped  by  the  addition  of  5x  Laemmli  sample 
buffer  and  processed  for  immunoblot  analysis  as  described 
(20).  Immunoblots  were  probed  with  a  1:5,000  dilution  of 
anti-mitogen-activated  protein  (MAP)  kinase  ascites  or  with  a 
mixture  of  anti-phosphotyrosine  antibodies  [1:2,000  dilution  of 
G-104  and  G-2-98  ascites  (22)]  and  were  developed  with  ECL 
reagents  (Amersham)  in  conjunction  with  an  horseradish 
peroxidase-labeled  rat  anti-mouse  K-chain  antibody  (Zymed). 

RESULTS 

Although  P-TEN  displays  structural  features  of  members  of 
the  PTP  family  of  enzymes  an  important,  and  essential,  step 
toward  understanding  the  function  of  P-TEN  is  the  charac¬ 
terization  of  its  enzymatic  activity.  To  test  whether  P-TEN 
encodes  an  active  phosphatase,  we  expressed  it  as  a  fusion 
protein  in  E.  coli.  As  a  control,  we  assayed  all  substrates  with 
a  mutant  of  P-TEN  in  which  the  essential  cysteine  from  the 
signature  motif  had  been  mutated  to  serine  (P-TENC124S). 
This  mutant  would  be  expected  to  be  catalytically  inactive,  and 
ensures  that  any  activity  seen  was  the  result  of  P-TEN  and  not 
of  a  copurifying  bacterial  phosphatase. 

PTP  Activity  of  P-TEN.  Initially,  we  assayed  P-TEN  against 
a  number  of  tyrosine-phosphorylated  proteins  and  peptides, 
including  reduced  carboxyamidomcthylated  and  malcylated 
lysozyme  (RCML),  MBP,  polyGIu4Tyr,,  and  polyGlu.Tyn,  as 
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well  as  the  peptide  EDNDYINASL.  The  activity  of  P-TEN 
toward  the  classical  substrates  RCML  and  MBP  was  weak 
(<80  pmol/min  per  mg).  However,  P-TEN,  exhibited  robust 
phosphatase  activity  when  polyGlu4Tyri  was  used  as  a  sub¬ 
strate  (4,840  pmol/min  per  mg)  (Table  1  and  Fig.  \A). 
Significantly,  addition  of  unphosphorylated  polyGlu4Tyn  to 
reactions  using  RCML  as  a  substrate  did  not  result  in  an 
increase  in  activity  toward  RCML  (data  not  shown),  indicating 
that  P-TEN  was  not  activated  by  the  polyanionic  character  of 
polyGlu4Tyn.  In  fact,  inclusion  of  unphosphorylated 
polyGIu4Tyri  in  reactions  containing  RCML  resulted  in  the 
inhibition  of  the  already  limited  dephosphorylation  of  RCML, 
suggesting  that  even  unphosphorylated  polyGlu4Tyri  was  ca¬ 
pable  of  binding  to  P-TEN,  displacing  the  more  weakly  inter¬ 
acting  RCML.  We  also  observed  that  P-TEN  dephosporylated 
polyGluiTyri.  When  the  stoichiometries  of  phosphorylation 
were  normalized,  no  significant  differences  were  detected  in 
the  rates  of  dephosphorylation  of  polyGlu4Tyri  and 
polyGluiTyri,  suggesting  that  the  acidic  character  of  these 
substrates  is  an  important  determinant  of  substrate  recogni¬ 
tion.  P-TEN,  however,  exhibited  reduced  activity  when  assayed 
with  the  acidic  peptide  EDNDYINASL  (Table  1),  suggesting 
that  the  mere  presence  of  acidic  residues  was  not  sufficient  to 
ensure  that  a  peptide  would  be  a  substrate.  cdc!4,  a  dual¬ 
specificity  phosphatase  that  is  closely  related  to  P-TEN  (8), 
does  not  discriminate  between  RCML  and  polyGlu4Tyri  (Fig. 

1 B),  indicating  that  polyGlu4Tyri  is  not  a  universal  substrate 
for  the  dual-specificity  phosphatases,  and  that  the  substrate 
specificity  of  P-TEN  is  likely  to  be  unique  feature  determined 
by  structural  motifs  separate  from  the  catalytic  motif.  Caution 
should  be  exercised  when  comparing  the  activity  of  P-TEN  to 
that  of  other  dual-specificity  phosphatases  because  of  the 
inherent  substrate  specificity  exhibited  by  these  enzymes. 

Dual-Specificity  Phosphatase  Activity  of  P-TEN.  To  test 
whether  P-TEN  fails  into  the  class  of  dual-specificity  phos¬ 
phatases,  we  assayed  activity  using  a  number  of  proteins  and 
peptides  phosphorylated  on  serine  and  threonine  residues. 
Similar  to  the  findings  with  tyrosine-phosphorylated  sub¬ 
strates,  P-TEN  dephosphorylated  serine/threonine  residues  in 
substrates  that  had  a  preponderance  of  acidic  residues.  Spe¬ 
cifically,  P-TEN  dephosphorylated  two  peptide  substrates 
(DSD  and  ETE)  with  the  highest  efficiency  (Table  1  and  Fig. 
2).  Furthermore,  P-TEN  showed  specificity  even  among  acidic 
serine/threonine  substrates,  exhibiting  a  reduced  activity  when 
casein,  phosphorylated  by  casein  kinase  II  or  protein  kinase  A, 
was  used  as  substrate  (Table  1  and  Fig.  2).  As  might  be 
anticipated  in  light  of  these  properties,  P-TEN  exhibited 
almost  undetectable  activity  when  assayed  with  polybasic 
substrates,  such  as  MBP  or  Kemptide  (LRR ASLG)  (Fig.  2  and 
data  not  shown).  The  finding  that  P-TEN  dephosphorylated 


Table  1.  Activity  of  P-TEN  measured  with  tyrosine,  serine,  and 
threonine  phosphorylated  substrates 


Substrate 

Activity, 

pmol/min  per  mg 

Phosphotyrosyl  substrates 

polyGlu4Tyri 

4,840  ±  140 

RCML 

88  ±  6.6 

EDNDYINASL  peptide 

51  ±  4.0 

MBP 

21  ±  1.2 

Phospho-seryl  and  -thrconyl  substrates 

DSD  peptide 

210  ±  4.2 

ETE  peptide 

161  ±  1.8 

Casein  (casein  kinase  11) 

14.2  ±  1.2 

Casein  (protein  kinase  A) 

27.5  ±  2.2 

MBP 

11.3  ±  0.6 

P-TEN  activity  was  measured  with  the  indicated  substrates  in 
triplicate.  The  activity  is  expressed  as  the  mean  pmol  phosphate 
released  per  min  per  mg  enzyme  ±  SD. 


Fig.  1.  Tyrosine  phosphatase  activity  of  purified  P-TEN.  (A) 
P-TEN  was  tested  for  protein  phosphatase  activity  using  the  indicated 
tyrosine-phosphorylated  substrates.  Activity  is  expressed  as  pmol  of 
phosphate  released.  A  catalyticaliy  inactive  mutant  of  P-TEN  (P- 
TENC124S)  was  included  as  a  control  to  rule  out  the  possibility  of 
contaminating  bacterial  phosphatases.  ( B )  Comparison  of  P-TEN  and 
cdcl4  activities.  P-TEN  and  cdcl4  were  assayed  as  above  with  RCML 
or  polyGlu4Tyri,  and  the  activity  is  expressed  as  pmol  of  phosphate 
released  per  min  per  mg. 

two  peptide  substrates  of  casein  kinase  II  also  indicates  that  the 
inability  of  P-TEN  to  dephosphorylate  EDNDYINASL  does 
not  simply  reflect  its  inability  to  dephosphorylate  small  pep¬ 
tide  substrates.  Although  many  dual-specificity  phosphatases 
show  a  preference  for  tyrosine  residues,  it  is  possible  that  the 
reduction  in  the  activity  of  P-TEN  toward  serine/threonine 
substrates  may  be  the  result  of  these  residues  being  located  m 
a  suboptimal  substrate  backbone. 

Dephosphorylation  of  ERK2.  A  number  of  dual-specificity 
phosphatases  show  a  preference  for  members  of  the  MAP 
kinase  family  (13, 23, 24).  Therefore,  we  tested  whether  P-TEN 
could  dephosphorylate  ERK2.  We  could  not  detect  dephos¬ 
phorylation  of  ERK2  by  P-TEN,  either  by  changes  in  anti- 
phosphotyrosine  antibody  reactivity  or  by  changes  in  the 
electrophoretic  mobility  of  ERK2.  In  contrast,  MAP  kinase 
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Fig  2  Dual-specificity  phosphatase  activity  of  purified  P-TEN. 
P-TEN  was  tested  for  protein  phosphatase  activity  using  the  indicated 

serine/threonine-phosphoryla.ed  substrates.  Ac.,v„y  ,s  expreSSe^ 

pmol  of  phosphate  released.  A  catalytically  inactive  mutant  of  P-TEN 
fP-TENC124S)  was  included  to  control  for  contaminating 
phosphatases.  Casein  was  phosphorylated,  as  indicated,  with  casein 
kinase  II  (CKH)  or  protein  kinase  A  (PKA). 

phosphatase  1  (MKP-1)  quickly  and  completely  dephospho- 
rvlated  ERK2,  as  shown  by  the  removal  of  phosphotyrosme 
from  ERK2  and  an  increase  in  its  electrophoretic  mobility 

fFie.  3  A  and  B,  respectively). 

Effects  of  Point  Mutations  on  the  Activity  of  P-TEN.  To  test 
whether  the  activity  of  P-TEN  was  altered  during  tumongen- 
esis  we  assessed  the  effect  on  activity  of  a  variety  of  point 
mutations  that  occur  in  P-TEN  isolated  from  tumor  specimens_ 
Many  of  these  mutations  occur  in  or  near  the  catalytic  motif 
of  P-TEN,  including  His-123  -*  Tyr  (H123Y),  Gly-129  -*  Arg 
(G129R)  or  Gly-129  -»  Glu  (G129E),  and  Met-134  ->  Leu 


Time  (min) 


ERK2-> 


Blot:  anti-phosphotyrosine 


B 


+  P-TEN 


+  MKP-1 


Time  (min) 


•  5  10  20  30  5  10  20  30 


Blot:  anti-ERK2 

Fig  3  P-TEN  docs  not  dcphosphorylate  ERK2.  Phosphorylated 
ERK2  was  incubated  with  P-TEN  or  MKP-1  for  the  indicated  times 
and  ERK2  assayed  for  residual  phosphotyrosme  by  immunob  ot  ng 
(A)  with  an  anti-phosphotyrosine  antibody  or  (B)  by  immunob totting 
with  anti-MAP  kinase  antibodies  to  visualize  the  change  in  elect 
phorctic  mobility. 
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fM134L)  In  addition,  several  mutations  were  found  that 
occurred  outside  the  conserved  catalytic  motif,  including 
i  S7  ->  Tro  (L57W)  and  a  cluster  of  mutations  C  terminal 
w  “he catalytic  oop  |GI,-165  -  Arg  (G165R),  Thr-167  Pro 
T  67P).  and  Ser-170  -  Arg  (S170R)].  The  exac.  pos, Irons  of 
these  mutations  in  P-TEN  are  indicated  in  Fig.  4.  We  produced 
GST  fusion  proteins  in  which  P-TEN  was  mutated  at  each  of 
these  positions  to  mimic  the  mutant  alleles,  and  the  phospha¬ 
tase  activity  of  the  resulting  recombinant  proteins  was/nea- 
sured  using  polyGlu4Tyr,  as  a  substrate  (Fig.  5).  With  the 
exceptions ^of  the  M134L  and  the  G129E  mutation,  all  the 
mutations  tested  resulted  in  a  dramatic  decrease  in  the  activity 
of  P-TEN  (Fig.  5).  These  data  indicate  that  the  catalytic 
activity  of  P-TEN  has  been  disrupted  in  the  majority  of  these 

‘UTheSactivity  exhibited  by  the  G129E  mutation  was  surpris¬ 
ing  especially  in  light  of  the  finding  that  mutation  of  this  same 
residue  to  arginine  (G129R)  resulted  in  a  significant  reduction 
of  P-TEN  activity.  Other  potential  effects  of  this  mutation  on 
P-TEN  such  as  changes  in  protein  stability  in  the  complex 
environment  of  the  mammalian  cell  or  changes  «n  substrate 
soecificity  have  not  been  addressed.  On  the  other  hand,  the 
essentially'  wild-type  activity  exhibited  by  the  M134L  mutat.on 

“111,  4**  ry 

phosphatases  contain  a  leucine  residue  at  this  position  (25). 

discussion 

The  most  common  chromosomal  deletion  in  glioblastoma 
occurs  around  10q22-23,  suggesting  the  presence  of  a  tumor 
suppressor  at  this  locus  (26).  Mapping  of  the  deletions  at 
10a22-23  as  well  as  representational  difference  analysis  (  ), 

led  to  the  identification  of  P-TEN  as  the  tumor  suppressor 
residing  at  this  locus  (4, 5).  P-TEN  shares  sequence homology 
with  the  cytoskeletal  protein  tensin  and  with  the  family  of 
PTPs  (4  5).  A  substantial  fraction  of  all  glioblastoma  samples 
tested  have  either  deleted  or  mutated  P-TEN  alleles.  In 
addition,  P-TEN  has  been  shown  to  be  disrupted  in  a  large 
number  of  breast  and  prostate  tumor  samp  es  (4,  5).  Impor¬ 
tantly,  germ-line  mutations  in  P-TEN  give  rise  to  a  variety  o 
genetic  disorders,  most  notably  Cowden  disease,  a  disorder 
characterized  by  the  formation  of  multiple  benign  tumors 
(hamartomas),  as  well  as  an  increased  susceptibility  to  malig¬ 
nant  cancers  of  the  breast  and  thyroid  (28),  confirming  that 
P-TEN  functions  similarly  to  classical  tumor-suppressor  genes 
(83  Significantly,  many  of  the  mutations  isolated  from  tumor 
samples,  as  well  as  from  Cowden  disease  and  Bannayan- 
Zonana  syndrome,  were  predicted  to  disrupt  the  phosphatase 

whether  P-TEN^aTJnctio'nal  phospTaSse  or' whether  these 

ra?SSS  measured  the  enzymatic  activity 
of  recombinant  P-TEN.  Initially  we  were  frustrated  by  he  low 
activity  of  the  fusion  proteins  when  assayed  with  protems  that 
are  commonly  used  as  PTP  substrates,  such  as  RCML  or  MBP^ 
Recently,  another  group  has  also  reported^ 
activity  when  RCMLwas  used  as  a  substrate  (29).  However  we 
demonstrate  that  P-TEN  exhibits  robust  PhosPh"tas° 
when  assayed  with  a  random  copolymer  of  glutamate  and 
wrosinc  oolvGlu^yr,,  suggesting  that  it  exhibits  an  unusual 
substrate  specificity.  P-TEN  did  not  efficiently  dephosphory- 
atS  aS?cfpepytidc  EDNDYINASL, 
presence  of  acidic  residues  may  not  be  sufficient  to  create  an 
Sal  substrate  and  that  higher  order  structures,  such  as 
those  found  in  protein  substrates,  may  be  require  • 

On  the  basis  of  the  presence  of  the  AYLL/M  mo 
in  many  dual-specificity  phosphatases,  as  weU  as  the  abac 
of  some  of  the  sequence  motifs  found  in  I’Tf  ’ P  ,|  deft 
those  that  contribute  to  creating  aDd«eP>  s“bstra^a  -Jc  jficity 
(30)  one  would  predict  that  P-TEN  is  a  dual-spec.tic.ty 
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Fig.  4.  Location  of  P-TEN  mutations.  A  diagram  of  P-TEN  showing  the  locations  of  the  point  mutations,  indicated  by  an  *,  that  were  tested 
in  this  study.  In  addition,  the  predicted  structural  motifs  (see  text)  in  which  these  mutations  lie  is  also  indicated. 


phosphatase.  P-TEN  was  shown  to  exhibit  activity  against 
serine/threonine,  as  well  as  tyrosine-phosphorylated  proteins 
(Table  1),  verifying  that  it  is  a  dual-specificity  phosphatase. 
Like  the  tyrosine-phosphorylated  substrates,  P-TEN  dephos¬ 
phorylated  only  the  most  acidic  serine/threonine-phosphory- 
lated  substrates  tested.  Even  the  best  serine/threonine  sub¬ 
strates,  the  ETE  and  DSD  peptides,  were  not  as  efficiently 
dephosphorylated  as  polyGlu4Tyri  (Table  1).  Casein,  phos- 
phorylated  by  casein  kinase  II,  was  dephosphorylated  poorly 
by  P-TEN  (Table  1  and  Fig.  2),  suggesting  that  P-TEN  may 
require  multiple  acidic  residues  positioned  both  N  and  C 
terminally  to  the  phosphorylated  residue.  It  is  currently  un¬ 
clear  if  the  apparent  amino  acid  selectivity  of  P-TEN  is  an 
intrinsic  characteristic  or  if  it  is  caused  by  differences  in  the 
affinity  of  P-TEN  for  polyGlu4Tyri  and  the  ETE  or  DSD 
peptides,  and  may  reflect  a  preference  for  protein  over  peptide 
substrates. 

Mutations  that  occur  in  P-TEN  during  tumorigenesis  fall 
into  three  large  classes:  (/)  genomic  deletions  encompassing  all 
or  most  of  P-TEN,  («)  frameshift  mutations  resulting  in  the 
production  of  truncated  P-TEN  proteins,  and  (///)  point  mu¬ 
tations  resulting  in  the  substitution  of  one  amino  acid  for 
another  (4,  5).  We  have  chosen  to  introduce  a  variety  of  the 
point  mutations  found  in  tumor  samples  into  P-TEN,  because 
they  are  less  likely  than  the  frameshift  deletions  to  cause 
unpredictable  changes  in  the  conserved  secondary  structure 
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Fig.  5.  Disruption  of  P-TEN  activity  by  point  mutations  found  in 
tumor  samples.  The  indicated  point  mutations  were  introduced  into 
recombinant  P-TEN  and  their  effects  on  phosphatase  activity  were 
determined.  Assays  were  performed  with  polyGlu4Tyri  for  15  min. 
Activity  is  expressed  as  pmol  of  phosphate  liberated  per  min  per  mg 
of  P-TEN.  Assays  were  performed  in  triplicate  and  are  expressed  as 
the  mean  ±  SD.  The  catalytically  inactive  mutant  of  P-TEN  (P- 
TENC124S)  was  included  as  control  to  rule  out  the  possibility  of 
contaminating  bacterial  phosphatases. 


Mutation 


found  in  all  PTPs  (30).  Comparisons  of  P-TEN  with  other 
phosphatases,  whose  crystal  structures  have  been  solved,  aids 
in  predicting  how  these  point  mutations  might  disrupt  P-TEN 
activity.  For  example,  the  mutation  of  His-123  — »Tyr  (H123Y) 
and  Gly-129  -»  Arg,  two  residues  located  in  the  catalytic  motif, 
resulted  in  the  complete  loss  of  phosphatase  activity.  One 
would  have  predicted  that  the  H123Y  mutation,  detected  in  an 
endometrial  cancer,  would  be  catastrophic  to  activity  because 
of  the  critical  importance  of  this  histidine  in  correctly  orienting 
the  catalytic  cysteine  so  it  can  act  as  a  nucleophile  (11). 
Although,  Gly-129  is  not  as  highly  conserved  as  other  residues 
found  in  the  catalytic  motif,  the  substitution  of  the  large, 
positively  charged  side  chain  of  arginine,  a  mutation  found  in 
a  glioblastoma  cell  line,  for  the  much  smaller  glycine,  is  likely 
to  have  a  deleterious  effect  on  the  overall  structure  of  the 
phosphate-binding  loop  and  is  likely  to  impede  the  binding  of 
phosphate.  In  contrast,  a  mutation  that  was  identified  in  a 
prostate  tumor  cell  line,  in  which  Met-134  was  changed  to  Leu 
(M134L),  had  no  effect  on  P-TEN  activity.  However,  among 
the  dual-specificity  phosphatases,  leucine  is  commonly  found 
at  this  position  (25),  suggesting  that  the  M134L  allele  may 
represent  a  naturally  occurring  polymorphism  rather  than  a 
mutation. 

Several  other  mutations  that  reside  outside  of  the  catalytic 
motif  were  also  tested  for  their  effect  on  P-TEN  activity.  A 
point  mutation  discovered  in  a  glioblastoma  sample  that 
changes  Leu-57  ->  Trp  (L57W)  also  eliminated  P-TEN  phos¬ 
phatase  activity.  This  amino  acid  is  located  in  a  conserved 
a-helix  (a2),  which  is  found  in  both  PTP1B  and  VHR. 
Although  not  directly  involved  in  catalysis,  this  helix  helps 
form  the  overall  secondary  structure  of  the  enzyme  (14,  15). 
Similarly,  mutation  of  residues  in  this  helix  of  LAR,  a  receptor 
PTP,  also  resulted  in  a  significant  loss  of  phosphatase  activity 
(31).  A  second  cluster  of  point  mutations  was  discovered  in  the 
last  conserved  structural  motif  found  in  most  PTPs  and 
dual-specificity  phosphatases,  which  also  is  an  a-helix  (14, 15). 
In  YopH,  a  PTP  isolated  from  the  causative  agent  of  bubonic 
plague,  a  hydrophobic  residue  in  this  a-helix  is  important  for 
coordinating  the  water  molecule  necessary  for  regenerating 
the  active  enzyme  (15).  All  three  mutations  found  in  this 
region — Gly-165  ->  Arg  (found  in  a  glioblastoma),  Thr-167  -» 
Pro  (found  in  a  breast  cancer),  and  Ser-70  ->  Arg  (found  in  a 
patient  with  a  Bannayan-Zonana  syndrome) — resulted  in  the 
loss  of  P-TEN  activity.  The  G165R  and  the  S170R  mutations 
would  result  in  substitution  of  relatively  small,  uncharged 
amino  acids  for  a  much  larger  positively  charged  residue, 
potentially  disrupting  important  interactions  between  this 
a-helix  and  surrounding  structures.  Moreover,  the  substitution 
of  Thr-167  -»  Pro  is  also  likely  to  disrupt  these  interactions  by 
interrupting  the  proper  folding  of  this  a-helix.  These  data 
indicate  that  this  conserved  helix  (al  in  YopH  and  VHR)  is  a 
required  motif  in  the  dual-specificity  phosphatases. 

All  of  the  mutations  tested  were  derived  from  tumor  samples 
that,  with  two  exceptions,  resulted  in  the  complete  loss  of,  or 
greatly  reduced,  enzymatic  activity  of  P-TEN,  even  when  the 
mutations  resided  well  outside  of  the  conserved  catalytic 
motif.  These  observations  indicate  that  the  inhibition  of  en- 
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zymatic  activity  was  required  for  the  progression  of  these  cells 
to  a  cancerous  state. 

M134Lwasone  of  two  mutations  found  in  which  activity  was 
not  lost.  However,  leucine  is  commonly  found  at  this  position 
in  other  dual-sped ficity  phosphatases  (25)  and  the  wild-type 
allele  was  not  lost  in  the  tumor  from  which  this  mutation 
derived.  Hence,  M134L  is  likely  to  be  a  naturally  occurring 
polymorphism. 

We  were  surprised  by  the  finding  that  the  G129E  mutation, 
a  mutation  found  in  the  germ  line  of  patients  suffering  from 
Cowden  disease,  did  not  disrupt  P-TEN  activity,  whereas  a 
mutation  of  this  same  residue  to  arginine  (found  in  a  glioblas¬ 
toma)  ablated  P-TEN  activity.  Although  we  were  unable  to 
detect  a  change  in  the  activity  of  P-TEN  carrying  the  G129E 
mutation,  it  is  likely  that  this  mutatioo  effects  the  ability  of 
P-TEN  to  function  in  vivo,  at  this  mutation  has  been  isolated 
from  two  independent  Cowden  disease  kindreds  but  not  from 
unaffected  patients  (8).  Moreover,  deletions  of  the  wild-type 
allele  were  found  in  the  tumors  that  develop  in  this  Cowden 
disease  kindred  (8).  Although  the  G129E  mutation  does  not 
effect  activity  when  measured  in  vitro  using  artificial  substrates, 
this  mutation  may  interfere  with  the  ability  of  P-TEN  to 
dephosphorylate  its  physiological  targets.  Furthermore,  this 
mutation  may  decrease  the  stability  or  half-life  of  P-TEN  in  the 
complex  environment  of  the  mammalian  cell.  A  full  under¬ 
standing  of  the  effects  of  this  mutation  on  P-TEN  will  require 
the  identification  of  its  physiological  substrates,  regulators  and 
perhaps  even  the  elucidation  of  the  structural  changes  elicited 
by  this  amino  acid  substitution. 

Even  though  the  sample  size  is  small,  there  does  appear  to 
be  a  correlation  between  the  severity  in  the  disruption  of 
P-TEN  activity  and  the  pathology  of  the  disease.  In  contrast  to 
the  G129E  mutation,  the  S170R  mutation  exhibited  a  signif¬ 
icant  reduction  in  activity  and  was  isolated  from  a  patient 
suffering  from  a  severe  disorder,  Bannayan-Zonana,  that 
shares  many  features  with  Cowden  disease,  but  also  manifests 
additional  pathologies,  including  early  onset  of  the  disease  at 
birth  (9). 

To  understand  how  P-TEN  functions  as  tumor  suppressor, 
it  will  be  necessary  to  identify  its  physiological  substrates.  Most 
dual-specificity  phosphatases  dephosphorylate  and  inactivate 
the  MAP  kinases  ( 13, 23, 24),  and  moreover,  the  MAP  kinases 
are  found  to  be  hyperphospborylated  in  breast  cancer  (32),  a 
cancer  that  frequently  contains  P-TEN  mutations  or  deletions 
(8).  Therefore,  we  tested  if  P-TEN  could  dephosphorylate  the 
MAP  kinase  ERK2.  However,  our  data  indicate  that  P-TEN 
was  incapable  of  dephosphorytating  ERK2  in  vitro,  strongly 
suggesting  that  the  MAP  kinases  are  not  regulated  by  P-TEN 
in  vivo.  Thus,  the  physiological  substrates  of  P-TEN  are  still 
unknown.  The  unique  substrate  specificity  of  P-TEN,  however, 
should  aid  in  identifying  its  in  vivo  substrates.  Importantly,  the 
requirement  for  acidic  residues  may  be  satisfied  by  the  pres¬ 
ence  of  phosphorylated  residues,  suggesting  that  multiply 
phosphorylated  proteins  may  serve  as  P-TEN  substrates. 
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ABSTRACT  The  isolation  of  genes  from  a  given  genomic 
region  can  be  a  rate-limiting  step  in  the  discovery  of  disease 
genes.  We  describe  an  approach  to  the  isolation  of  cDNAs  that 
have  sequences  in  common  with  large  genomic  clones  such  as 
bacterial  artificial  chromosomes.  We  applied  this  method  to 
loci  both  amplified  and  deleted  in  cancer,  illustrating  its  usage 
in  the  identification  of  both  oncogenes  and  tumor  suppressor 
genes,  respectively.  The  method,  called  rapid  isolation  of 
cDNAs  by  hybridization  (RICH),  depends  on  solution  hybrid¬ 
ization,  enzymatic  modification,  and  amplification/selection 
of  sequences  present  in  both  cDNA  populations  and  the 
genomic  clones.  The  method  should  facilitate  the  development 
of  transcription  maps  for  large  genomic  clones,  possibly  even 
yeast  artificial  chromosomes. 


Powerful  methods  have  facilitated  the  localization  of  disease 
genes  to  regions  of  the  genome.  Typically,  candidate  regions  are 
contained  on  large  yeast  or  bacterial  cloning  vectors,  and  these 
vectors  must  be  searched  assiduously  by  various  means  for 
candidate  genes.  This  step  has  often  proved  to  be  an  obstacle  in 
gene  discovery.  The  problem  has  been  attacked  in  roughly  three 
ways:  by  hybridization  (1-3),  sequence  analysis  (4,  5),  and  exon 
trapping  methods  (6-8),  Each  method  has  its  own  particular 
advantages,  but  no  current  method  is  without  serious  problems. 

We  present  herein  a  method  we  call  rapid  isolation  of 
cDNAs  by  hybridization  (RICH)  based  on  the  identification  of 
sequences  in  common  between  a  cDNA  library  and  a  large 
clone  of  genomic  DNA.  The  method  selects  and  amplifies 
those  restriction  endonuclease  fragments  of  cDNAs  that  hy¬ 
bridize  precisely  at  one  end  to  the  end  of  a  similarly  cleaved 
genomic  DNA  fragment.  Before  hybridization,  the  cDNA  and 
genomic  fragments  are  modified  with  different  adaptors. 
Those  cDNAs  that  form  hybrids  with  genomic  DNA  at  at  least 
one  end  are  ligated  to  a  “selection  adaptor”  that  is  comple¬ 
mentary  to  the  genomic  adaptor  and  contains  an  additional 
sequence  complementary  to  an  RNA  polymerase  site.  Such 
cDNAs  can  be  selectively  amplified  by  successive  treatments 
with  RNA  and  DNA  polymerases. 

We  illustrate  the  basic  method  with  two  series  of  experi¬ 
ments:  (i)  a  search  for  transcripts  from  the  c -MYC  locus  in 
cDNAs  from  a  breast  cancer  cell  line  and  (w)  a  search  for 
transcripts  from  the  PTEN  tumor  suppressor  locus  in  cDNAs 
from  normal  breast  tissue.  Although  the  method  is  complex,  in 
that  many  different  enzymes  (restriction  endonucleases,  var¬ 
ious  DNA  ligases,  RNA  polymerase,  various  DNA  endo-  and 
exonucleases,  reverse  transcriptase,  and  various  DNA  poly¬ 
merases)  are  used,  they  are  all  robust  enzymes  that  are  readily 
available.  Only  3  days  are  required  to  yield  candidates  for 
further  study. 


The  publication  costs  of  this  article  were  defrayed  in  part  by  page  charge 
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MATERIALS  AND  METHODS 

Materials.  The  oligonucleotides  which  were  synthesized  for 
this  research  are  listed  in  Table  1  and  obtained  from  Biosyn¬ 
thesis  (Lewisville,  TX).  PI  clone  8001  (Genome  Systems,  St. 
Louis)  is  an  80-kb  genomic  clone  that  contains  the  exons  1-3 
of  the  c -MYC  gene.  A  bacterial  artificial  chromosome  (BAC) 
clone,  60C5,  containing  exons  4-9  of  PTEN  (containing 
nucleotides  1,244-2,246  of  the  published  cDNA,  GenBank 
accession  no.  U92436)  was  obtained  from  Genome  Systems. 
The  plasmid  pUC18,  digested  with  BamHl  and  treated  with 
bacterial  alkaline  phosphatase  was  supplied  by  Amersham. 
pCR-Script  SK(+)  and  Epicurian  Coli  XL2-Blue  cells  were 
obtained  from  Stratagene.  SKBr3  is  a  breast  cancer  cell  line 
from  which  the  poly(A)+  RNA  was  extracted  with  the  Fast- 
Track  kit  (Invitrogen).  cDNA  was  synthesized  from  the 
poly(A)+  RNA  of  SKBr3  and  commercially  available  poly(A)+ 
RNA  of  mammary  gland  tissue  (CLONTECH)  by  using  the 
Copy  kit  (Invitrogen).  The  Megascript  kit  was  from  Ambion 
(Austin,  TX). 

The  enzymes  and  their  specific  buffers  used  in  RICH  were 
obtained  from  the  following  suppliers:  Sau3Al  and  T4  DNA 
ligase  from  New  England  Biolabs;  Stoffel  fragments,  Ampli- 
Taq,  and  AmpliTaq  Gold  from  Perkin-Elmer;  Ampligase  from 
Epicentre  Technologies  (Madison,  WI);  Pfu  DNA  Polymerase 
and  RNase-free  Dnase  from  Stratagene;  and  A  exonuclease 
from  Amersham. 

Glycogen  was  obtained  from  Boeringer  Mannheim. 
GeneQuant  G-50,  S-300HR,  and  S-400HR  columns  and 
Sephaglas  BandPrep  kit  were  obtained  from  Amersham. 
RNase-free  water  was  supplied  by  Ambion.  Maxi  prep  kit  was 
supplied  by  Qiagen  (Hilden,  Germany).  Phenol  was  prepared 
as  described  elsewhere  (9). 

RICH  Standard  Protocol.  To  facilitate  description  of  the 
RICH  protocol,  we  have  broken  up  the  procedure  into  discrete 
units,  labeled  those  units  with  an  alphabetic  letter,  and  assigned 
that  letter  both  to  the  procedure  and  its  final  product  (Fig.  1). 

A.  Preparation  of  genomic  DNA.  For  the  preparation  of 
genomic  DNA,  1  /xg  of  BAC  DNA  is  digested  with  20  units  of 
Sau3Al.  The  digests  are  purified  by  phenol/chloroform  ex¬ 
traction  and  ethanol  precipitation.  The  whole  digests  are 
mixed  with  100  pmol  of  pR-24  and  100  pmol  of  R-12  oligo¬ 
nucleotides  (Table  1)  in  20  pi  of  IX  T4  DNA  ligase  buffer.  The 
double-strand  cDNA  and  the  oligonucleotides  are  heated  at  65°C 
for  5  min,  annealed  by  cooling  down  the  mixture  to  4°C  gradually, 
and  then  ligated  by  overnight  incubation  with  2,000  units  of  T4 
DNA  ligase  at  11°C.  The  excess  oligonucleotides  are  removed 
from  the  ligated  fragments  with  S-400HR  columns. 

B.  Preparation  of  selection  adaptor.  The  selection  adaptor  is 
synthesized,  phosphorylated  at  its  5' end,  and  purified  by 
PAGE.  The  working  concentration  of  the  adaptor  is  50  nM. 


Abbreviations:  RICH,  rapid  isolation  of  cDNA  by  hybridization;  BAC, 
bacterial  artificial  chromosome;  RIGHT,  rapid  isolation  of  genes  by 
hybridization  to  transcripts. 
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Table  1.  Oligonucleotide  sequences  for  RICH 


Oligo 

Sequence 

5'  end 

N-12 

GATCTTCCCTCG 

Dephosphorylated 

R-12 

GATCTGCGGTGA 

Dephosphorylated 

pN-24 

AGGCAACTGTGCTATCCGAGGGAA 

Phosphorylated 

pR-24 

AGCACTCTCCAGCCTCTCACCGCA 

Phosphorylated 

pR(+)SP6 

CTGCGGTGAGAGGCTGGAGAGTGCTCTATAGTGTCACCTAAAT 

Phosphorylated 

SP6 

TATTTAGGTGACACTATAGAGCA 

Dephosphorylated 

C.  Preparation  of  cDNA.  Double-stranded  cDNA  is  synthe¬ 
sized  from  5  fig  of  poly(A)  +  RNA  with  the  Copy  kit.  One 
microgram  of  cDNA  is  digested  with  20  units  of  Sau  3AI  and 
ligated  to  100  pmol  of  pN-24  and  100  pmol  of  N-12  oligonu¬ 
cleotides  in  a  20 -pi  reaction  volume  as  described  above.  A 
partial  fill-in  mixture  is  made  by  combining  21  fx  1  of  water,  5 
fi\  of  10  X  Stoffel  buffer,  1  pi  of  10  mM  dATP,  1  pi  of  10  mM 
dGTP,  and  1  pi  of  10  mM  dTTP  and  is  added  to  the  ligation 
mixture.  After  incubation  at  72°C  for  5  min,  2  units  of  Stoffel 
fragment  are  added  to  the  mixture  and  incubated  at  72°C  for 
additional  5  min.  The  fragments  are  extracted  twice  with  50  pi 
of  phenol/chloroform.  The  aqueous  phase  is  transferred  to  a 
new  tube. 

D.  Hybridization.  Ten  microliters  of  genomic  fragments  (com¬ 
ponents  A),  5  pi  of  R( + )SP6  (components  B),  and  25  pi  of  cDNA 
fragments  (components  C)  are  mixed  and  then  extracted  twice 
with  40  pi  of  phenol/chloroform.  Ten  microliters  of  10  M 
ammonium  acetate,  1  pi  of  glycogen  (20  mg/ml),  and  92  pi  of 
ethanol  are  added.  After  centrifugation  and  washing  with  70% 
ethanol,  the  DNA  is  dissolved  in  4  pi  of  3 x  EE  buffer  (10).  The 
solution  is  overlaid  with  30  pi  of  mineral  oil  and  denatured  by 
incubation  in  boiling  water  for  5  min.  One  microliter  of  5  M  NaCl 
is  added  and  the  DNA  is  reannealed  for  16  h  at  67°C. 

E.  Ligation  of  selection  adaptor .  To  ligate  the  selection 
adaptor  to  the  cDNA  fragments,  6  pi  of  10  x  Ampligase  buffer 
and  1  pi  of  Ampligase  (100  units/pl)  are  added  to  48  pi  of 

ABC 

Preparation  of  Preparation  of  Preparation  of 

Genomic  DNA  Selection  adaptor  cDNA 


D  Hybridization 

E  Ligation  of  selection  adaptor 


F  Amplification  and  attenuation 


Q  RNA  Transcription 
and  attenuation 


H  RT-PCR  amplification 


I  Cloning  of  RICH  products 
and  analysis 

Fig.  1.  Flowchart  of  RICH.  Each  boldface  letter  represents  a  unit 
of  the  RICH  procedure  described  in  the  standard  RICH  protocol. 


water,  and  this  ligation  mixture  is  incubated  at  67°C  for  3  min 
before  adding  to  the  solution  (component  D).  The  total 
mixture  is  then  incubated  for  4  h  at  67°C.  After  ligation,  the 
DNA  is  extracted  with  60  pi  of  phenol/chloroform  twice, 
precipitated  by  ethanol  as  above,  and  dissolved  in  50  pi  of  TE 
buffer  (10  mM  Tris-HCl,  pH  7.5/1  mM  EDTA).  Excess 
R(+)SP6  is  removed  by  S-400HR  after  denaturation. 

F.  Amplification  and  attenuation.  Five  microliters  of  the 
DNA  solution  (component  E)  is  mixed  with  50  pi  of  reaction 
mixture  [lx  AjnpliTaq  buffer/all  four  dNTPs  (each  at  200 
nM)/l  pM  pN-24/1  pM  SP6/1  unit  of  AmpliTaq  Gold].  The 
DNA  is  amplified  by  a  PCR  program  that  specifies  an  incu¬ 
bation  for  10  min  at  94°C  followed  by  10  cycles  of  1  min  at  94°C, 

1  min  at  60°C,  and  3  min  at  72°C.  After  purification  with  S-300 
HR  columns,  the  DNA  is  digested  at  37°C  with  10  units  of  A 
exonuclease.  Eighty  microliters  of  TE  is  added  to  the  reaction 
mixture  and  incubated  in  boiling  water  for  5  min.  The  100-pl 
PCR  mixture  contains  8  pi  of  the  DNA,  all  four  dNTPs  (each 
at  200  pM),  1  pM  pN-24,  1  pM  SP6,  and  Pfu  buffer  at  a  1 X 
final  concentration.  After  initial  denaturation  of  94°C  for  5 
min,  2.5  units  of  Pfu  polymerase  is  added  to  the  mixture  and 
20  cycles  at  94°C  for  1  min  and  70°C  for  4  min  are  performed, 
followed  by  a  final  extension  at  72°C  for  10  min.  The  PCR 
products  are  purified  with  phenol/chloroform  extraction  and 
ethanol  precipitation.  The  DNA  is  dissolved  in  25  pi  of 
RNase-free  water  after  washing  with  70%  ethanol. 

G.  RNA  transcription  and  attenuation.  RNA  is  transcribed 
from  8  pi  of  the  DNA  template  (component  F)  with  the 
Megascript  kit.  After  a  6-h  incubation  at  37°C,  20  units  of 
RNase-free  Dnase  I  is  added  and  the  incubation  is  continued 
for  an  additional  10  min.  The  RNA  is  purified  with  phenol/ 
chloroform  extraction  and  dissolved  in  15  pi  of  RNase-free 
water  after  ethanol  precipitation. 

H.  Reverse  transcription-coupled  PCR  amplification.  To  syn¬ 
thesize  cDNA,  1.3  pi  of  50  pM  pN-24  is  added  to  the  mixture 
(component  G),  incubated  at  65°C  for  10  min,  and  then  placed 
at  room  temperature  for  5  min.  Double-stranded  cDNA  is 
synthesized  by  the  Copy  kit  following  the  supplier’s  protocol. 
After  the  products  are  denatured  by  heating  to  94°C  for  10  min, 
they  are  purified  with  S-300HR  columns.  The  200-pl  PCR 
mixture  contains  10  pi  of  the  cDNA,  all  four  dNTPs  (each  at 
200  pM),  1  pM  pN-24, 1  pM  pR-24, 5  units  of  AmpliTaq  Gold, 
and  AjnpliTaq  buffer  at  a  1 X  final  concentration.  After  initial 
denaturation  of  94°C  for  10  min,  30  cycles  at  94°C  for  1  min  and 
72°C  for  3  min  were  performed,  followed  by  a  final  extension 
at  72°C  for  10  min. 

I.  Cloning  and  sequencing.  The  RICH  products  were  purified 
with  GeneQuant  G-50  columns  and  digested  with  40  units  of 
Saw3AI  in  a  120-pl  reaction  volume  at  37°C  for  4  h.  The  digests 
were  size-fractionated  by  electrophoresis  through  an  agarose 
gel  for  Southern  blot  analysis  and  for  cloning.  The  DNA  bands 
were  excised,  purified  with  the  Sephaglas  BandPrep  kit,  and 
cloned  into  pUC18  digested  with  BamHl  and  treated  with 
bacterial  alkaline  phosphatase.  Epicurian  Coli  XL2-Blue  cells 
were  transformed  with  the  plasmid.  The  plasmid  DNA  was 
purified  with  the  Maxi  prep  kit  and  sequenced  by  the  Cycle 
Sequencing  system  (Perkin-Elmer). 

/.  Southern  blot  analysis.  A  part  of  RICH  product  was 
electrophoresed  through  a  4%  agarose  gel  and  transferred  to 
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Hybond  N+  (Amersham).  Hybridization  with  the  PTEN 
cDNA  probe  was  detected  by  the  ECL  system  (Amersham). 

K  Making  representations  of  genomic  clones.  In  one  variation 
of  the  basic  method,  we  used  representations  of  genomic 
clones.  To  make  the  representation,  100  ng  of  BAC  DNA  is 
digested  with  5  units  of  Sau3Al.  The  digests  are  purified  by 
phenol/chloroform  extraction  and  ethanol  precipitation.  The 
whole  digests  are  mixed  with  100  pmol  of  pR-24  and  100  pmol 
of  R-12  oligonucleotides  (Table  1)  in  20  pi  of  IX  T4  DNA 
ligase  buffer.  The  double-strand  cDNA  and  the  oligonucleo¬ 
tides  are  heated  at  65°C  for  5  min,  annealed  by  cooling  the 
mixture  to  4°C  gradually,  and  then  ligated  by  overnight  incu¬ 
bation  with  2,000  units  of  T4  DNA  ligase  at  11°C. 

RESULTS 

The  Basic  Method.  For  convenience,  we  break  the  method 
into  two  parts.  In  part  one,  we  add  a  selection  adaptor  to  only 
those  cDNA  fragments  that  hybridize  at  one  end  to  an  end  of 
a  genomic  fragment.  In  part  two,  we  selectively  amplify  only 
those  modified  cDNA  fragments.  The  complexity  of  the 
method  arises  from  the  need  to  suppress  the  amplification  of 
self-annealed  cDNA  and  genomic  homoduplexes.  Fig.  2  out¬ 
lines  the  basic  schema. 

Part  One.  First,  both  the  large  genomic  clone  and  cDNA  are 
separately  cleaved  with  Sau3Al  and  modified  with  T4  DNA 
ligase  by  the  addition  of  oligonucleotide  adaptors  (pR-24  for 
genomic  DNA  and  pN-24  for  cDNA)  at  their  5'  ends  only. 
These  adaptors  have  phosphate  groups  at  their  5'  ends,  for 
reasons  that  will  become  apparent  in  part  two. 

Next,  the  3'  ends  of  the  adapted  cDNA  molecules  are 
partially  filled-in  with  the  three  nucleotides  guanosine,  aden¬ 
osine,  and  thymidine  by  using  the  Stoffel  fragment  (a  Taq 
DNA  polymerase  lacking  both  3'  and  5'  exonuclease  activity). 
This  step  enables  us  to  distinguish  later  the  homoduplexes  of 
genomic  DNA  from  the  cDNA-genomic  heteroduplexes.  An 
alternative  to  partial  filling  is  presented  later. 

The  cDNA  and  genomic  DNAs  are  mixed  at  equal  mass 
ratios,  and  the  selection  adaptor  pR(+)SP6  is  added. 
pR(+)SP6  is  complementary  to  pR-24  but  has  an  extra 
cytidine  at  its  5'  end  and  a  sequence  complementary  to  the  SP6 
RNA  polymerase  promoter  sequence  at  its  3'  end. 

The  mixture  is  heat-denatured  and  allowed  to  reanneal. 
During  reannealing,  two  and  three  part  structures  will  form,  as 
indicated  in  Fig.  2A.  The  reannealed  mixture  is  treated  with  a 
thermostable  DNA  ligase,  Ampligase,  at  68°C  to  ensure  that 
perfect  matches  are  preferentially  ligated. 

Ligation  of  pR(+)SP6  to  cDNA  will  occur  only  when  the 
latter  is  annealed  to  a  matching  genomic  fragment,  forming  a 
perfect  contiguous  substrate  for  the  ligase.  The  extra  cytidine 
at  the  5'  end  of  pR(+)SP6  is  needed  to  fill  the  gap  of  the 
partially  filled-in  cDNA  fragment.  pR(+)SP6  will  not  be 
ligated  to  genomic  homoduplexes  because  those  three  part 
structures  will  have  a  3-nucleotide  gap.  Neither  will  pR(+)SP6 
be  ligated  to  cDNA  homoduplex  because  the  latter,  lacking 
pR-24  and  having  only  a  1-base  overhang,  will  not  base-pair  to 
pR(+)SP6. 

Thus  only  cDNAs  with  homology  to  genomic  fragments  at 
one  end  will  have  the  pN-24  adaptors  at  their  5'  ends  and  the 
pR(  +  )SP6  selection  adaptor  at  their  3'  ends. 

Part  Two.  The  selectively  modified  cDNA  fragments  are 
amplified,  in  the  following  manner.  First,  after  ligation,  we 
denature  the  mixture  and  use  Taq  DNA  polymerase  and  an 
oligonucleotide  primer  containing  the  SP6  promoter  sequence 
to  make  the  selectively  modified  cDNAs  double-stranded. 
These  SP6  primers  are  not  phosphorylated  at  their  5'  ends. 
This  step  is  repeated  10  times  to  give  an  arithmetic  increase  in 
the  number  of  complementary  strands  (see  Fig.  2 B). 

Next,  the  entire  mixture  is  treated  with  A  exonuclease,  which 
degrades  double-stranded  fragments  from  their  phosphory- 
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Fig.  2.  RICH  procedure.  (A )  Part  1 .  (B )  Part  2.  A  circle  represents 
a  phosphorylated  5'  end.  Open  boxes  represent  filled-in  bases.  Oli¬ 
gonucleotides  and  their  antisense  sequences  are  drawn  as  dark  and 
light  shaded  boxes. 


Genetics:  Hamaguchi  et  al 


Proc.  Natl.  Acad.  Scl.  USA  95  (1998)  ?767 


lated  5'  ends.  The  strands  synthesized  from  the  SP6  primers  are 
thus  protected.  A  small  number  of  duplexes  will  be  formed 
during  the  reannealing  of  the  previous  step  and,  when  filled-in 
by  Taq  polymerase,  would  contaminate  the  subsequent  reac¬ 
tions.  These  contaminants  are  destroyed  by  A  exonuclease 
because  the  adaptors  pR-24  and  pN-24  have  phosphorylated  5' 
ends. 

To  amplify  the  SP6-primed  strand,  we  perform  a  PCR  with 
Pfu  DNA  polymerase  and  SP6  and  pN-24  oligonucleotides  as 
primers.  We  use  Pfu  polymerase  because  its  products  are 
blunt-ended,  whereas  Taq  polymerase  sometimes  adds  an  extra 
nucleotide  to  the  3'  end  of  its  product,  and  the  SP6  RNA 
polymerase  used  in  the  next  step  does  not  work  well  on 
substrates  with  3'  protruding  ends. 

In  this  amplification  step,  it  is  likely  that  cDNA  homodu¬ 
plexes  that  escaped  A  exonuclease  treatment  will  be  amplified. 
Moreover,  a  significant  amount  of  genomic  homoduplexes 
would  be  amplified  in  later  steps.  Therefore,  we  use  SP6  RNA 
polymerase  (Ambion)  to  create  RNA  transcripts  of  those 
molecules  that  contain  the  SP6  promoter  sequence  and  digest 
any  surviving  DNA  molecules  with  DNase  I. 

Finally,  reverse  transcription  with  avian  myeloblastosis  virus 
reverse  transcriptase  (Invitrogen)  using  pN-24  as  primer, 
followed  by  PCR  with  Taq  DNA  polymerase  and  pN-24  and 
pR-24  as  primers,  yields  products  for  cloning  and  analysis. 

Testing  the  c-MYC  Locus.  Our  first  mock  experiments,  not 
shown,  used  serial  dilutions  of  the  bacterial  plasmid  pUC18 
into  a  cDNA  library,  and  DNA  from  a  160-kbp  BAC  (60C5) 
to  which  was  added  an  equimolar  amount  of  the  same  plasmid. 
These  experiments  demonstrated  to  our  satisfaction  that  the 
procedure  would  work  well  with  cDNA  species  containing  as 
little  as  0.01%  of  a  cDNA  library.  Success  of  the  method  was 
not  evident  when  the  pUC18  was  present  in  cDNA  at  the 
0.0001%  level.  We  proceeded  next  with  actual  test  cases. 

In  our  first  test  case,  we  used  an  80-kb  PI  bacterial  cloning 
vector  containing  the  entire  c-MYC  gene  and  used  randomly 
primed  double-stranded  cDNA  that  was  prepared  from  the 
poly(A)+  RNA  of  the  breast  cancer  cell  line  SKBr3,  in  which  the 
c-MYC  locus  is  amplified.  Northern  blot  analysis  indicated  that 
the  level  of  c-MYC  expression  from  SKBr3  was  in  the  middle  of 
the  range  for  a  panel  of  breast  cancer  cell  lines,  including  BT20, 
Du4475,  HS578T,  MDA134,  MDA231,  MDA436,  SKBr3, 
UACC812,  UACC893,  ZR75-1,  and  ZR75-30.  We  estimated 
that  c-MYC  expression  was  3-fold  higher  in  SKBR3  than  in 
normal  breast  tissue  and  3-fold  lower  than  in  the  highest  express¬ 
ing  tumor  cell  line  tested,  ZR75-30. 

Clones  were  prepared  from  the  RICH  products  by  either 
blunt-ended  ligation  to  pCR-Script  SK(+)  or  cleavage  with 
Sau3Al  followed  by  ligation  to  pUC18.  Vector  inserts  were 
sequenced.  All  blunt-ended  inserts  contained  both  the  pN-24  and 
pR-24  sequences  and  the  adjoining  GATC  sequences,  indicating 
that  they  all  derived  from  cDNA-genomic  heteroduplexes. 

cDNA  insert  sizes  varied  from  61  to  259  bp.  Roughly 
one-quarter  of  clones  either  had  no  matches  in  the  data  bases 
or  were  repeat  sequences,  mainly  of  the  a  In  family.  The 
remaining  three-quarters  had  identity  to  sequences  found  on 
c-MYC  mRNA,  as  indicated  in  Fig.  3.  One  type  of  RICH 
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Fig.  3.  RICH  products  and  the  c-MYC  gene.  Arrows  indicate 
Saw3AI  recognition  sites.  Shadowed  boxes  represent  RICH  products, 
the  sizes  of  which  are  150, 250, 166,  and  135  bp  from  the  left.  The  large 
restriction  fragment  that  we  failed  to  isolate  lies  between  third  and 
forth  arrows  from  the  left  and  is  880  bp. 


product  derived  from  exon  1  of  c-MYC  entirely,  one  spanned 
exons  2  and  3,  and  two  derived  entirely  from  exon  3.  Shorter 
fragments,  less  than  100  bp,  and  a  larger  fragment,  880  bp 
spanning  exon  1  and  2,  were  not  obtained  in  the  first  81  clones 
examined.  No  clones  from  either  the  3'  or  5'  ends  of  c-MYC 
were  obtained,  and  this  is  predicted  by  theory  because  a  cDNA 
fragment  would  need  to  have  Sau3Al  sites  at  both  ends  to  be 
found. 

Testing  the  PTEN  Locus.  In  a  second  test  of  the  method,  we 
used  a  160-kb  BAC,  60C5,  containing  PTEN,  a  tumor  sup¬ 
pressor  gene  that  encodes  a  mixed-specificity  protein  phos¬ 
phatase  expressed  ubiquitously  but  at  a  low  level  in  normal 
tissues  (11-13).  We  prepared  double-stranded  cDNA  from 
normal  human  breast  tissue  by  random  priming  of  poly(A)  + 
RNA.  As  a  control  we  used  another  BAC  from  a  different 
region  of  the  genome.  By  following  the  same  protocol  as  above, 
we  obtained  RICH  products  from  each  BAC  and  probed  the 
products  with  PTEN  cDNA  by  Southern  blotting.  Only  RICH 
products  from  the  PTEN  BAC  yielded  fragments  that  hybrid¬ 
ized  with  PTEN  sequences  (Fig.  4).  In  total  at  least  three 
PTEN  products  were  observed,  about  200,  350,  and  570  bp 
long.  We  expected  fragment  sizes  of  153,  298,  and  520  bp 
without  adaptors  and  201,  346,  and  568  bp  with  adaptors,  all 
of  which  span  exons.  We  failed  to  observe  a  range  of  smaller 
fragments  (Fig.  5  and  see  Discussion). 

To  estimate  the  ratio  of  the  products  that  derived  from 
PTEN  cDNA  to  products  deriving  from  repeat  sequences,  a 
collection  of  clones  were  analyzed  by  filter  hybridization  with 
both  PTEN  cDNA  and  total  human  DNA  as  probes.  Six  of  120 
clones  hybridized  to  PTEN  cDNA  and  about  60  hybridized  to 
total  human  genomic  sequences. 

To  extend  the  utility  of  RICH,  we  tested  a  variation  in  the 
method:  we  used  the  PCR  product  from  a  large  insert  cloning 
vector  as  our  source  of  genomic  DNA.  For  this  purpose,  we 
gel-purified  PTEN  BAC  and  cleaved  it  with  Sau3Al,  ligated 
pR-24  to  the  5'  ends,  filled-in  the  3'  ends,  and  used  PCR  to 
amplify  fragments  with  pR-24  as  primer.  We  used  this  ampli¬ 
fied  material  in  an  otherwise  identical  protocol  and  obtained 
results  that,  by  Southern  blotting,  appeared  as  satisfactory  as 
when  we  began  with  restriction  endonuclease-cleaved  BAC 
DNA. 

We  analyzed  in  greater  depth  the  RICH  products  of  this 
experiment  by  blunt-ended  cloning  into  pCR-Script  SK(+). 
Ninety-six  individual  clones  were  analyzed.  They  were  tested  for 
the  presence  of  inserts  by  PCR  using  primers  derived  from  the 


Fig.  4.  RICH  products  from  the  PTEN  gene.  Lanes:  A,  RICH 
products  obtained  from  the  PTEN  BAC,  60C5;  B,  RICH  products 
from  an  unrelated  BAC;  C,  pBR322  DNA  digested  with  Msp\.  Lanes 
A  and  B  were  transferred  to  a  nylon  membrane  and  probed  with  PTEN 
cDNA  (lanes  D  and  E)  or  total  human  DNA  (lanes  F  and  G).  Lancs: 
D  and  F,  RICH  products  from  the  PTEN  BAC;  E  and  G,  RICH 
products  from  the  unrelated  BAC. 
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Fig.  5.  RICH  products  and  the  PTEN  gene.  Arrows  indicate 
Sau3Al  recognition  sites.  The  shadowed  boxes  represent  RICH 
products,  the  sizes  of  which  are  298, 520,  and  153  bp  from  the  left.  The 
restriction  fragments  in  exon  8  and  exon  9  that  we  failed  to  isolate  are 
81,  33,  27,  and  29  bp  from  the  left. 

vector  cloning  sites  and  for  the  presence  of  the  Sau3Al  sites  by 
cleavage  of  the  PCR  product.  By  this  simple  test,  only  17  had 
Sau3Al  sites  at  the  expected  sites  within  the  inserts.  Sequence 
analysis  indicated  that  all  17  of  these  contained  both  the  pN-24 
and  pR-24  primers  at  the  proper  sites  (this  could  also  have  been 
determined  by  PCR).  Of  the  17,  five  derived  from  PTEN  cDNA, 
representing  the  153-bp  (three  times)  and  the  298-bp  (twice) 
Sau3A\  fragments.  The  larger  fragment,  520  bp  long,  was  not 
obtained  in  this  group.  The  remaining  12  inserts  contain  a 
Sau3Al  site  within  human  repeat  sequences.  Two  of  these 
contained  repeats  we  have  not  yet  found  in  the  sequence  of  the 
PTEN  BAC,  the  sequence  of  which  is  now  90%  complete. 

DISCUSSION 

Finding  genes  in  large  chromosomal  regions  has  been  ap¬ 
proached  in  three  ways:  exon  trapping,  DNA  sequencing 
analysis,  and  direct  hybridization  selection.  Exon  trapping 
works  when  the  gene  in  question  contains  splicing  sites  that  are 
efficiently  recognized  by  the  host  cell  (6-8).  But  it  fails  when 
introns  are  absent  or  the  intron-exon  borders  are  not  recog¬ 
nized;  and  exon  trapping  yields  a  background  of  false  candi¬ 
dates  derived  from  cryptic  splice  sites.  DNA  sequencing  is 
effective  when  the  gene  in  question  has  homology  to  a  known 
expressed  sequence.  Even  without  homology,  computational 
methods  for  predicting  genes  also  have  promise  (4).  But  DNA 
sequencing  on  a  massive  scale  is  still  costly.  Direct  hybridiza¬ 
tion  selection  (1)  has  also  found  use,  but  it  diminishes  in 
usefulness  with  rare  messages  and  suffers  from  the  vagaries  of 
physical  selection  methods  and  background  problems  with 
repetitive  sequences.  We  have  described  an  additional  ap¬ 
proach  to  this  problem:  an  effective  protocol  for  selecting 
cDNA  fragments  that  are  homologous  at  one  of  their  ends  to 
one  of  the  ends  from  a  collection  of  genomic  fragments.  This 
method  should  work  whenever  a  cDNA  population  is  available 
that  contains  transcripts  from  the  gene  in  question. 

A  protocol  (end  ligation  coincident  sequence  cloning,  EL- 
CSC)  similar  to  ours,  has  been  presented  by  Brookes  et  al  (14). 
Like  ours,  their  protocol  is  based  upon  heteroduplex  formation 
between  DNA  fragments  made  from  two  populations,  and  the 
use  of  ligation  (with  what  they  call  “capture  oligonucleotides”) 
to  distinguish  heteroduplex  from  homoduplex.  Unlike  our 
procedure,  their  method  requires  heteroduplex  formation  at 
both  ends  between  cDNA  and  genomic  fragments,  because 
both  ends  of  the  heteroduplex  must  be  ligated.  Thus,  cloning 
of  cDNA  fragments  that  span  introns  is  much  less  likely.  In  our 
procedure,  we  use  a  selection  adaptor  that  allows  us  to 
generate  an  RNA  intermediate,  and  so  we  can  isolate  hetero¬ 
duplexes  that  have  formed  at  only  one  end.  Moreover,  EL-CSC 
uses  physical  trapping  through  biotin-avidin  complex  forma¬ 
tion  to  enrich  for  products.  We  have  experienced  difficulty 
with  protocols  incorporating  such  methods  and  have  avoided 
them  in  RICH.  The  report  of  Brookes  et  al  (14)  does  not 
contain  sufficient  information  to  enable  us  to  make  quantita¬ 
tive  comparisons  of  our  methods  nor  have  we  found  their 
procedure  used  in  the  published  literature. 


Although  the  yield  of  c -MYC  fragments  in  the  RICH 
products  was  very  satisfactory,  the  yield  of  PTEN  products  was 
less  so.  An  additional  prescreening  of  RICH  products  from 
PTEN  was  needed:  namely,  the  verification  of  the  adaptors  and 
the  Sau3A\  sites  that  should  be  present  upon  proper  priming 
and  ligation.  We  believe  that  this  is  due  to  the  reduced  level  of 
expression  of  the  PTEN  gene  compared  with  c -MYC.  More¬ 
over,  an  analysis  of  our  products  from  the  PTEN  BAC  revealed 
a  higher  proportion  of  repeat-containing  sequences  than  were 
found  for  c -MYC,  presumably  for  the  same  reason. 

Not  all  fragments  from  the  same  cDNA  will  have  the  same 
yield.  For  example,  the  880-bp  fragment  from  c -MYC  was  not 
obtained  as  a  RICH  product.  We  speculate  this  may  be  due  to 
the  difficulty  of  obtaining  long  transcripts  from  the  SP6 
polymerase.  Also,  we  did  not  clone  the  520-bp  fragment  of 
PTEN  that  was  seen  upon  Southern  blotting  to  be  present  in 
reduced  amount.  Most  strikingly,  we  do  not  observe  very  short 
fragments  in  the  RICH  products.  Possibly,  this  deficit  is 
partially  due  to  the  slower  kinetics  of  hybridization  of  shorter 
fragments  (15).  Gel  fractionation  before  cloning  might  over¬ 
come  some  of  these  problems  of  underrepresentation.  Alter¬ 
natively,  these  limitations  can  be  overcome  by  using  different 
restriction  endonucleases  during  the  protocol. 

We  have  introduced  one  variation  in  our  method:  amplifying 
genomic  DNA  fragments  before  use.  This  step  was  incorpo¬ 
rated  for  two  reasons.  (/)  If  genomic  fragments  are  amplified 
and  not  cleaved,  it  should  be  unnecessary  to  partially  fill-in  the 
ends  of  restriction  endonuclease  cleaved  cDNA,  because  the 
selection  adaptor  cannot  be  ligated  to  PCR-amplified  genomic 
DNA.  Thus,  with  this  variation,  any  restriction  endonuclease 
can  be  chosen,  if  used  both  for  cDNA  cleavage  and  amplifi¬ 
cation  of  genomic  fragments,  overcoming  limitations  in  the 
discovery  of  cDNAs  that  might  result  from  the  use  of  Sau3A\ 
discussed  above,  (ii)  With  genomic  amplification  the  user  can 
initiate  the  search  for  transcripts  with  very  small  amounts  of 
genomic  DNA.  In  fact,  we  have  performed  RICH  starting 
from  small  amounts  of  gel-purified  BAC  DNA.  It  may  be 
possible  to  extend  this  method  to  gel-purified  yeast  artificial 
chromosome  DNAs. 

We  cannot  obtain  the  3'  or  5'  ends  of  cDNA  transcripts  with 
RICH  because  only  cDNA  fragments  with  restriction  endo¬ 
nuclease  cleavage  sites  at  both  ends  can  be  selectively  ampli¬ 
fied.  However,  RICH  can  be  used  in  reverse  (rapid  isolation 
of  genes  by  hybridization  to  transcripts,  RIGHT)  to  identify 
genomic  clones  with  homology  to  cDNAs.  The  directional 
modifications  of  genomic  fragments  and  cDNAs  used  in  RICH 
can  be  essentially  reversed,  rendering  genomic  fragments  that 
form  heteroduplex  with  cDNA  the  only  selectable  RIGHT 
products.  When  unamplified  cDNAs  are  used,  the  RIGHT 
protocol  should  yield  the  3'  and  5'  ends  of  a  transcription  unit. 
In  addition,  RIGHT  might  facilitate  the  confirmation  of 
cDNAs  found  by  RICH  and  aid  in  the  determination  of 
intron-exon  boundaries. 
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v  ABSTRACT  Analysis  of  the  genetic  changes  in  human 

tumors  is  often  problematical  because  of  the  presence  of 
normal  stroma  and  the  limited  availability  of  pure  tumor 
DNA.  However,  large  amounts  of  highly  reproducible  “repre¬ 
sentations”  of  tumor  and  normal  genomes  can  be  made  by 
PCR  from  nanogram  amounts  of  restriction  endonuclease 
cleaved  DNA  that  has  been  ligated  to  oligonucleotide  adaptors. 
We  show  here  that  representations  are  useful  for  many  types 
of  genetic  analyses,  including  measuring  relative  gene  copy 
number,  loss  of  heterozygosity,  and  comparative  genomic 
hybridization.  Representations  may  be  prepared  even  from 
sorted  nuclei  from  fixed  and  archived  tumor  biopsies. 


Analysis  of  the  genetic  changes  in  human  tumors  is  often 
problematical  because  of  the  presence  of  normal  stroma. 
Although  either  microdissection  or  flow  cytometry  can  pro¬ 
duce  small  samples  highly  enriched  for  tumor  cells  or  nuclei, 
the  extracted  DNA  is  of  insufficient  quantity  for  most  uses. 
Nevertheless,  we  have  successfully  performed  a  complex  pro¬ 
tocol,  representational  difference  analysis  (RDA),  on  such 
small  samples.  RDA  is  a  subtractive  DNA  hybridization  tech¬ 
nique  that  discovers  the  differences  between  paired  normal 
and  tumor  genomes  (1).  The  first  step  of  RDA  is  the  prepa¬ 
ration  of  “representations,”  which  are  highly  reproducible 
reformattings  and  amplifications  of  DNA  populations.  Typi¬ 
cally,  a  representation  is  a  set  of  restriction  endonuclease 
fragments  of  a  limited  size  range  amplified  by  PCR.  As  much 
as  100  pg  of  DNA  can  be  prepared  from  as  little  as  3  ng  of 
DNA  («1  X  103  cells). 

In  RDA,  a  representation  with  much  lower  complexity  than 
the  starting  population  is  needed  to  enable  a  subtractive 
hybridization  step  to  proceed  effectively.  Such  low  complexity 
representations  (LCRs)  do  not  “capture”  enough  (typically, 
—7%)  of  the  genome  to  be  useful  for  many  of  the  more 
common  types  of  analyses.  However,  we  demonstrate  here  that 
high  complexity  representations  (HCRs)  can  provide  ample 
amounts  of  DNA  in  a  sufficiently  reproducible  manner  suit¬ 
able  for  most  conventional  studies.  We  demonstrate  one  type 
of  HCR  that  captures  ~70%  of  the  genome  and  illustrate  its 
use  for  determining  gene  copy  number,  deletion  mapping,  loss 
of  heterozygosity  (LOH),  and  comparative  genomic  hybrid¬ 
ization  (CGH).  HCRs  may  be  a  generally  useful  means  of 
“immortalizing”  and  archiving  DNA  for  later  analysis  from 
nonrenewable  sources. 

MATERIALS  AND  METHODS 

Materials.  Restriction  endonucleases  as  well  as  T4  DNA 
ligase,  T4  DNA  polymerase,  and  T4  polynucleotide  kinase 
were  supplied  by  New  England  Biolabs.  AmpliTaq  was  sup¬ 
plied  by  Perkin-Elmer.  Oligonucleotide  adaptors  RBgl-12  and 
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RBgl-24  [as  used  in  Lisitsyn  et  al  (1)],  oligonucleotides  used 
for  PCR  of  World-Wide  Web  probes,  and  tetranucleotide 
repeat  D17S695  were  synthesized  by  BioSynthesis  (Lewisville, 
TX).  Genescreen  Plus  was  purchased  from  DuPont.  Radioac¬ 
tive  nucleotides,  red/prime  labeling  kit,  dNTPs,  and  hyperfilm 
were  purchased  from  Amersham  Pharmacia  Biotech  (Uppsala, 
Sweden).  Cell  lines  used  were  obtained  through  American 
Type  Culture  Collection,  grown  in  culture,  and  DNA- 
prepared.  Normal  human  placenta  DNA  was  obtained  from 
CLONTECH.  Oligonucleotides  and  probes  for  use  with  the 
Applied  Biosystems  7700  Sequence  Detector  were  synthesized 
by  Applied  Biosystems.  Nusieve  agarose  gels  used  for  analysis 
of  PCR  products  were  purchased  from  FMC.  World-Wide 
Web  probe  sequences  were  downloaded  from  the  Massachu¬ 
setts  Institute  of  Technology  human  genome  sequencing  da¬ 
tabase  (http://www-genome.wi.mit.edu/). 

Production  of  Representations.  Genomic  DNA  (3-10  ng) 
was  digested  by  the  desired  restriction  endonucleases  (Dpnll 
and  Bglll)  under  conditions  suggested  by  the  supplier.  The 
digest  was  purified  by  phenol  extraction  and  then  precipitated 
in  the  presence  of  10  jag  of  tRNA.  The  digested  DNA  was 
resuspended  by  the  addition  of  444  pmol  of  each  adaptor 
(RBgl24  and  RBgll2),  T4  DNA  ligase  buffer  (diluted  to  lx, 
provided  with  enzyme),  and  water  to  bring  the  volume  to  30  jal. 
The  reaction  was  placed  in  a  55°C  heat  block,  and  the 
temperature  was  decreased  slowly  to  15°C  by  placing  the  heat 
block  at  4°C  (for  ~1  hr).  On  reaching  15°C,  the  RBgl24 
adaptor  was  ligated  by  the  addition  of  400  units  of  T4  DNA 
ligase  and  by  incubation  at  15°C  for  12-18  hr.  The  ligated 
material  was  divided  into  two  tubes,  and  the  following  was 
added:  80  jul  of  5X  PCR  buffer  [335  mM  Tris-HCl,  pH  8.8/20 
mM  MgCL/SO  mM  (NH^SC^/SO  mM  /3-mercaptoethanol/ 
0.5  mg/ml  of  BSA],  2'-deoxynucleoside  5 '-triphosphates  to  a 
final  concentration  of  0.32  mM,  RBgl24  adaptor  to  a  final 
concentration  of  0.6  jaM,  and  H20  to  bring  the  volume  to  400 
/xl.  Each  reaction  was  overlaid  with  100  jul  of  mineral  oil.  The 
reaction  was  placed  in  a  thermal  cycler  preheated  at  72°C,  and 
15  units  of  AmpliTaq  was  added  to  the  tubes.  The  thermal 
cycler  was  set  to  continue  at  72°C  for  5  min  to  allow  for  filling 
in  the  3'  ends  of  the  ligated  molecules.  This  step  was  followed 
by  20  cycles  lasting  1  min  at  95°C  and  3  min  at  72°C,  with  an 
additional  extension  of  10  min  at  72°C  after  the  last  cycle.  The 
PCR  was  divided  into  two  tubes  (now  a  total  of  four  tubes),  and 
the  following  reagents  were  added:  40  jul  5X  PCR  buffer  (as 
above),  dNTP  to  a  final  concentration  of  0.32  mM,  RBgl24 
adaptor  to  a  final  concentration  of  0.6  jaM,  water  to  bring  the 
volume  to  400  jul,  and  100  /xl  of  mineral  oil.  The  tubes  then 
were  amplified  for  an  additional  five  cycles  with  extension 
according  to  the  above  conditions.  The  reactions  were  purified 
by  phenol-chloroform  and  then  precipitated  by  the  addition  of 
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l/10th  the  reaction  volume  of  sodium  acetate  (3M  pH  5.2)  and 
by  the  addition  of  one  reaction  volume  of  isopropanol. 

PTEN  Tumor  Suppressor  Lori  Genomic  Sequencing.  Bac¬ 
terial  artificial  chromosome  DNA  was  purified  by  detergent 
lysis  and  polyethylene  glycol  precipitation.  DNA  (5  pg)  was 
sheared  and  then  repaired  by  using  T4  DNA  polymerase. 
Fragments  in  the  range  of  1.5-2  kb  were  isolated  by  gel 
fractionation  and  inserted  into  M13  by  ligation.  Sequencing 
reactions  were  done  by  using  dye  primer  chemistry  that  uses 
energy  transfer  primers  (2)  and  thermosequenase  polymerase 
(3).  Reactions  were  run  on  an  Applied  Biosystems  377  se¬ 
quencer  for  6-  to  8-hr  sequence  runs  for  analysis.  Base  calling 
was  carried  out  by  using  the  program  phred  (P.  Green, 
University  of  Washington).  The  collection  of  initial  data  was 
assembled  into  a  contig  by  using  the  automated  assembly 
program  phrap  (P.  Green).  The  database  generated  was 
converted  to  XGAP  format  and  edited  by  using  XGAP  (4,  5). 
After  completion  of  ~6-fold  coverage,  the  finishing  process 
was  carried  out  by  using  the  program  finish  (G.  Marth, 
Genome  Sequencing  Center  at  Washington  University)  to  fill 
gaps  in  the  contig. 

Southern  Blotting.  Normal  DNA  from  a  tumor  cell  line  or 
representations  derived  from  sorted  nuclei  were  digested  with 
either  Dpnll  or  Bglll.  Digested  genomic  DNA  (5.0  pg), 
digested  HCR  (5.0  pg) ,  or  digested  LCR  (0.5  pg)  was  loaded 
on  a  1.5%  agarose  gel.  The  gel  was  transferred  to  Genescreen 
Plus  after  electrophoresis  was  completed.  After  transfer,  the 
blot  was  hybridized  in  GIBCO  prehybridization  solution  at 
67°C  for  2  hr.  The  probe  was  produced  by  random  priming  with 
the  rediprime  kit  in  the  presence  of  32PadCTP.  After  12-18  hr 
of  hybridization,  the  blots  were  washed  with  standard  saline 
phosphate/EDTA  buffer  as  described  (6).  The  blots  were  used 
for  producing  either  auto-radiograms  and/or  phosphorimages. 

LOH  Analysis.  LOH  was  assessed  by  using  the  tetranucle- 
otide  repeat  marker  D17S695  (listed  as  UT269  in  the  National 
Center  for  Biotechnology  Information  Web  Site  http:// 
www.ncbi.nlm.nih.gov/Entrez/nucleotide.html),  which  maps 
to  the  17pl3  region  of  the  human  genome  and  has  been  used 
to  determine  the  state  of  LOH  at  the  p53  locus  (7,  8).  The 
reverse  primer  was  labeled  by  phosphorylation  in  the  presence 
of  32PyATP  as  described  (7,  8).  The  labeled  reverse  primer  in 
combination  with  both  forward  and  reverse  primers  was  used 
for  PCR  by  using  50  ng  of  HCR  as  DNA  template.  PCRs  were 
performed  as  described  in  the  National  Center  for  Biotech¬ 
nology  Information  Web  Site,  and  gel  electrophoresis  was 
performed  as  described  (7,  8). 


CGH.  CGH  was  performed  according  to  standard  proce¬ 
dure  (9)  by  using  HCR  DNAs  and  genomic  DNAs  prepared 
from  BT474  and  MCF7  as  target  and  normal  female  human 
lymphocyte  as  the  reference.  The  genomic  and  HCR  DNA 
samples  were  labeled  by  nick-translation  with  fluorescein 
isothiocyanate-12-dUTP  and  Texas  Red  5-dUTP  to  produce 
DNA  fragments  ranging  in  size  between  500  and  2,000  bp. 

Quantitative  PCR.  Primer  and  probe  sequences  were  pro¬ 
duced  according  to  the  Applied  Biosystems  software  PRIMER 
EXPRESS.  These  primers  and  probes  were  used  in  a  PCR  that 
was  placed  in  the  Applied  Biosystems  7700  Sequence  Detector 
and  amplified  as  described  (10,  11).  Samples  used  were  HCRs 
produced  from  DNA  derived  from  nuclei  of  sorted  primary 
tumor  biopsies.  After  the  PCR,  the  data  of  fluorescence 
corresponding  to  cycle  number  were  downloaded  from  the 
Applied  Biosystems  7700  Sequence  Detector  to  Microsoft 
EXCEL  and  analyzed  to  give  the  graphs  shown  later. 

RESULTS 

The  Basic  Method.  DNA  from  samples,  usually  paired  tumor 
cells  or  nuclei  and  normal  cells  or  nuclei,  were  processed  in 
parallel  to  prepare  representations.  DNA  was  cleaved  with  a 
restriction  endonuclease,  such  as  DpnW  or  Bglll,  that  is  not 
blocked  by  5-methylcytosine,  and  double-stranded,  cohesive, 
adaptor  oligonucleotides  were  ligated  to  the  fragment  ends. 
The  adaptors  were  not  phosphorylated  and  therefore  could  not 
be  self-ligated  to  form  interfering  dimers.  Because  adaptors 
can  be  ligated  only  to  the  5'  ends  of  the  cleavage  fragments,  the 
ligated  product  then  was  treated  with  DNA  polymerase  to  fill 
in  the  3'  ends,  forming  the  primer  binding  site  for  the 
subsequent  PCR.  Amplification  was  performed  in  stages,  each 
stage  using  the  adaptor  oligonucleotides  as  primers  and  pro¬ 
ceeding  for  no  more  than  20  rounds  per  stage.  PCR  does  not 
amplify  all  fragments  equally  well,  and  high  molecular  weight 
fragments  in  particular  are  very  poorly  amplified.  Thus,  re¬ 
striction  endonucleases  that  cleave  infrequently  were  used  to 
prepare  LCRs,  and  enzymes  that  cleave  frequently  were  used 
to  prepare  HCRs  (see  Fig.  1).  From  3  ng  of  starting  material, 
we  could  obtain  40  pg  of  total  HCR  from  a  total  of  25  rounds. 
We  have  not  tested  the  usefulness  of  an  HCR  beyond  a  total 
of  35  rounds. 

Sampling  the  Complexity  and  Reproducibility  of  Dpnll 
Representations.  We  first  sampled  the  reproducibility  and 
complexity  of  Dpnll  HCRs.  We  analyzed  14  different  HCRs, 
each  made  from  5  ng  of  DNA  prepared  from  diploid  nuclei 
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Fig.  1.  Schematic  comparison  of  LCR  and  HCR,  illustrating  the  complexity  reduction  that  occurs  after  cleavage  with  rare  and  frequent  cutters, 
respectively. 
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separated  from  tumor  biopsies  by  flow  cytometry  and  each 
amplified  by  PCR  for  25  rounds.  In  our  first  sampling,  we 
designed  pairs  of  PCR  primers  to  detect  Web  Site  sequence- 
tagged  sites.  We  picked  sequence-tagged  sites  that  were  not 
cleaved  by  Dpnll  and  used  primer  pairs  that  amplified  a  single 
band  from  total  genomic  DNA  controls.  Of  these,  18  of  25 
pairs  (72%)  were  able  to  amplify  the  same  molecular  weight 
fragment  from  each  HCR,  and  7  generally  failed  to  amplify 
from  any  HCR.  Our  results  suggest  that  Dpnll  HCRs  repro- 
ducibly  contain  the  same  elements  and  ^10%  of  the  genome. 

We  performed  a  similar  sampling  with  primer  pairs  derived 
from  the  locus  encoding  the  PTEN  tumor  suppressor  gene,  for 
which  we  had  the  complete  nucleotide  sequence  (unpublished 
data).  In  this  way,  we  were  able  to  use  primers  derived  from 
Dpnll  fragments  of  known  size.  Dpnll  fragments  were  chosen 
at  random,  and  PCR  primer  pairs  were  designed  for  each. 
Primer  pairs  (22  pairs)  amplified  single  fragments  by  PCR 
from  control  genomic  DNAs.  These  pairs  were  used  with  the 
same  panel  of  14  HCRs  used  above.  Primer  pairs  (20  pairs) 
amplified  the  expected  fragment  from  all  HCRs,  and  2  pairs 
failed  to  amplify  from  any.  The  fragments  that  were  not  in  the 
HCRs  were  the  largest,  3,916  bp,  and  one  of  the  smallest,  97 
bp.  Totaling  all  fragment  lengths,  16,039  bp  were  included  in 
the  HCRs,  and  4,013  bp  were  excluded.  Thus,  assuming  that 
our  initial  selection  of  Dpnll  fragments  was  random,  the  HCRs 
contained  ^75%  of  the  PTEN  region. 

If  Dpnll  cleavage  is  nearly  complete  during  the  preparation 
of  an  HCR,  we  expect  that  no  PCR  primer  pairs  should  readily 
amplify  from  an  HCR  when  the  amplified  sequence  has  an 
internal  Dpnll  site.  To  test  this,  we  chose  four  primer  pairs 
from  the  PTEN  locus  that  amplified  a  single  fragment  con¬ 
taining  a  single  internal  Dpnll  site.  All  four  pairs  amplified 
fragments  from  genomic  DNA  controls,  and  none  amplified 
detectable  fragments  from  the  14  HCRs.  We  conclude  that 
HCRs  prepared  in  parallel  from  samples  processed  in  parallel 
are  reasonably  reproducible  and  represent  ^70%  of  the  hu¬ 
man  genome. 

Measuring  Gene  Copy  Number  in  HCRs.  Tumor  genomes 
often  contain  either  extra  copies  of  sequences  caused  by  gene 
amplification  or  missing  sequences  caused  by  gene  deletion.  To 
explore  the  usefulness  of  representations  for  measuring  gene 
copy  number,  we  first  compared  Southern  blots  of  genomic 
DNA  to  blots  of  HCRs  and  LCRs.  For  this  purpose,  we  used 
human  placental  DNA  as  normal  and  prepared  genomic  DNA 
from  tumor  cell  lines  amplified  at  c-erbB2  (BT-474)  (12)  or 
c-myc  (SK-BR-3)  (13).  HCRs  and  LCRs  were  made  from  cell 
line  or  placental  DNAs  by  using  Dpnll  or  Bgl II,  respectively. 
As  probes  we  used  small  Bglll  fragments  that  we  cloned  from 
Pis  containing  inserts  from  the  designated  loci.  The  blots, 
shown  in  Fig.  2 A,  were  quantitated  by  phosphorimaging.  To 
normalize  for  loading  differences,  the  blots  were  stripped  and 
rehybridized  with  a  single  copy  sequence  probe.  The  normal¬ 
ized  ratios  of  signal  from  tumor  and  normal  are  tabulated  in 
Fig.  2 B.  The  same  relative  copy  number  (tumor  to  normal)  was 
determined  from  blots  of  representations  as  was  determined 
from  blots  of  genomic  DNAs,  indicating  that  there  is  a 
quantitatively  reproducible  amplification  of  these  test  se¬ 
quences  during  the  preparation  of  either  HCRs  or  LCRs 
prepared  in  parallel  from  similar  starting  materials.  Similar 
results  were  obtained  for  the  cyclin  D  locus  (data  not  shown). 

To  explore  the  usefulness  of  HCRs  for  deletion  mapping,  we 
blotted  both  genomic  and  HCR  DNAs  from  tumor  cell  lines 
for  deletion  at  the  20pll  locus.  This  locus  was  discovered 
initially  by  using  RDA  and  subsequently  was  found  to  be 
deleted  frequently  in  gastrointestinal  cancers  (R.L.,  unpub¬ 
lished  data).  Fig.  2C  illustrates  that  the  probe  hybridized  to 
sequences  in  the  HCRs  when  and  only  when  it  hybridized  to 
sequences  in  the  respective  genomic  DNA.  An  unrelated  probe 
detects  sequences  present  in  all  genomic  samples  and  their 
representations. 
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Fig.  2.  Analysis  of  copy  number  using  representations  (Rep)  and 
genomic  DNA  (Gen)  for  c-erbB2  and  c-myc.  {A)  Southern  blot 
comparing  tumor  cell  lines  (T)  to  normal  (N)  cell  lines.  Dpnll  was  used 
to  prepare  HCRs  (Dpnll),  and  Bglll  was  used  to  prepare  LCRs  (Bglll). 
As  a  reference,  free  probe  also  was  run  alongside  the  samples  (probe). 
The  hybridization  probes  were  derived  from  small  Bglll  fragments 
isolated  from  PI  clones  specific  for  each  locus  respectively.  (B) 
Quantitation  of  the  Southern  blot.  Rep,  representation;  Gen,  genomic 
DNA;  Dpnll,  either  Dpnll  HCR  or  Dpnll  digest  of  genomic  DNA; 
Bgl II,  either  Bglll  HCR  or  Bglll  digest  of  genomic  DNA.  Blots  in  A 
were  stripped  and  reprobed  with  a  single  copy  probe  to  analyze  loading 
differences  (data  not  shown).  Phosphorimage  analysis  of  this  probe 
was  used  to  normalize  the  amplification  differences.  After  this  anal¬ 
ysis,  the  resulting  intensity  of  tumor  was  divided  by  the  intensity  of 
normal  to  give  the  fold  amplification  displayed.  (C)  Deletion  mapping 
using  HCRs  and  the  deletion  mapping  of  seven  tumor  cell  lines 
comparing  blots  of  HCRs  (HCR)  with  blots  of  the  corresponding 
genomic  DNA  (Genomic),  each  designated  1-7.  The  probe  used  for 
hybridization  is  from  the  human  genomic  region  20pll  (HCR).  The 
Genomic  panel  shows  the  same  DNAs  blotted  with  an  unrelated  probe. 

We  tested  the  value  of  the  HCRs  made  from  limited 
amounts  of  DNA  for  quantitation  of  copy  number.  HCRs 
prepared  from  aneuploid  and  diploid  nuclei  sorted  from 
several  breast  cancer  biopsies  were  blotted  for  c-erbB2.  Fig.  3 A 
illustrates  that  c-erbB2  is  amplified  in  the  HCRs  made  from  the 
aneuploid  nuclei  of  some  biopsy  samples.  Fig.  3 B  shows  the 
blots  with  an  unrelated  probe.  We  obtained  confirmation  of 
the  validity  of  the  c-erbB2  amplifications  by  demonstrating 
that  probes  adjacent  to  but  distinct  from  the  c-erbB2  probe  also 
were  amplified  in  the  same  samples  (M.N.,  unpublished  re¬ 
sults). 
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Fig.  3.  Comparison  of  HCRs  from  primary  tumor  biopsies  by 
Southern  blotting.  (A)  HCRs  from  the  designated  tumor  biopsies 
(denoted  by  an  identification  number  preceded  by  BBR)  were  pre¬ 
pared  from  diploid  (Dpi)  and  aneuploid  (Anu)  nuclei  and  compared 
by  Southern  blot  analysis.  The  c-erbB2  probe  was  the  same  as  that  used 
in  Fig.  2.  ( B )  The  same  DNAs  blotted  with  an  unrelated  probe. 

Finally,  we  tested  some  of  the  same  samples  by  quantitative 
PCR.  For  this  purpose,  we  used  the  dual-labeled  fluorogenic 
hybridization  probes  and  the  Applied  Biosystems  7700  Se¬ 
quence  Detector  (10,  11)  to  compare  HCR  DNAs  prepared 
from  aneuploid  and  diploid  nuclei  pairs.  The  results,  shown  in 
Fig.  4,  indicate  that  differences  in  copy  number  were  detected 
by  probes  for  the  c-erbB2  oncogene  and  the  pl6  tumor 
suppressor.  On  the  other  hand,  no  differences  in  copy  number 
were  detected  by  probes  from  an  uninvolved  region  on  chro¬ 
mosome  3.  The  curve  of  amplification  of  the  c-erbB2  fragment 
arises  four  cycles  sooner  in  one  aneuploid  HCR  than  it  does  in 
the  paired  diploid  HCR,  indicating  a  higher  copy  number  for 
c-erbB2  (^16-fold  higher  in  the  aneuploid  HCR  sample).  The 
curve  for  the  aneuploid  HCR  deleted  for  pi 6  arises  three 
cycles  later  than  the  paired  diploid  HCR  (approximately 

FHIT 


one-eighth  as  much  in  the  aneuploid  HCR),  probably  reflect¬ 
ing  ^10%  contamination  of  the  aneuploid  nuclei  with  diploid 
nuclei  after  sorting.  These  results  were  confirmed  by  Southern 
blotting  (data  not  shown).  One  tumor/normal  pair  showed  a 
shift  of  a  single  cycle  for  primer  pairs  detecting  the  pl6  gene, 
which  might  reflect  loss  of  a  single  allele  in  the  tumor  or 
experimental  error. 

Detection  of  LOH  in  HCRs.  LOH  is  a  common  lesion  found 
in  cancer  cells  and  may  be  indicative  of  genomic  instability 
and/or  the  loss  of  function  of  a  specific  tumor  suppressor  gene 
(14-17).  The  detection  of  LOH  often  is  obscured  by  the 
presence  of  normal  stroma;  hence,  we  tested  whether  HCRs 
prepared  from  minute  amounts  of  samples  highly  enriched  for 
tumor  nuclei  could  be  used  for  LOH  analysis.  PCR  primers 
that  amplify  microsatellites  and  detect  fragment  length  poly¬ 
morphisms  are  used  frequently  for  LOH  mapping,  and  we 
chose  to  examine  a  primer  pair  that  amplifies  a  highly  poly¬ 
morphic  tetranucleotide  repeat  near  the  p53  locus  (7,  8). 

In  preliminary  experiments,  we  established  that  these  PCR 
primers  detected  the  same  allele  pattern  in  both  genomic  and 
HCR  DNAs  prepared  from  cell  lines.  Next,  we  examined  12 
pairs  of  HCRs  prepared  from  aneuploid  and  diploid  nuclei. 
LOH  at  this  locus  was  detected  clearly  in  9  of  10  informative 
pairs  (see  Fig.  5  for  representative  cases),  which  is  greater  than 
the  reported  proportion  of  LOH  at  this  locus  in  breast  cancer 
(50%)  (14,  18,  19)  but  may  reflect  the  selection  of  the  highly 
aneuploid  tumors  that  we  sorted  and/or  the  purity  of  our 
samples. 

Comparative  Genome  Hybridization  with  HCRs.  CGH  is  a 

powerful  tool  for  analyzing  the  global  genomic  changes  of 
tumors  (20-25).  It  has  been  reported  that  chromosome  paint- 
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Fig.  4.  Quanitative  PCR  analysis  of  HCRs.  Diploid  (black)  and  aneuploid  (gray)  HCRs  derived  from  sorted  primary  tumor  biopsies  were  used 
as  template  for  quanitative  PCR  analysis.  Probes  from  several  genomic  loci  (FHIT,  pi 6,  and  c-erbB2)  were  used  to  determine  copy  number  in  three 
primary  tumor  pair  HCRs  (BBR44,  BBR49,  and  CHTN5).  The  data  from  the  Applied  Biosystems  7700  Sequence  Detector  was  analyzed  with 
Microsoft  EXCEL  to  produce  the  graphs  shown,  (x  axis,  the  cycle  number  during  the  reaction;  y  axis,  the  fluorescence  detected.) 
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Fig.  5.  Using  HCRs  for  LOH  analysis.  LOH  analysis  was  carried 
out  on  HCRs  derived  from  sorted  primary  tumor  biopsies.  Dpi,  diploid 
HCR;  Anu,  aneuploid  HCR.  The  tumors  are  labeled  as  either  BBR, 
CHTN,  or  NSBR  followed  by  an  identification  number.  The  primers 
used  in  the  reaction  amplify  a  tetranucleotide  repeat  within  the  p53 
locus.  A  mixed  population  of  normal  DNAs  was  used  as  a  positive 
control.  (+Gen,  normal  genomic  DNA  used  as  template;  +HCR, 
HCR  produced  from  this  normal  DNA  used  as  template.  reaction 
in  which  no  template  was  added.) 

ing  could  be  performed  with  representations  produced  from 
each  chromosome  (26).  We  therefore  tested  whether  CGH 
could  be  performed  with  HCRs.  For  this  experiment,  we  chose 
to  examine  tumor  cell  lines  so  that  we  could  directly  compare 
CGH  performed  with  genomic  DNA  with  CGH  performed 
with  HCR.  Little  difference  between  HCR  and  genomic  DNA 
could  be  discerned  with  either  of  the  two  cell  lines  examined, 
BT-474  and  MCF7.  Fig.  6  shows  a  sample  of  the  chromosomal 
scanning  profiles  obtained  with  each  DNA  source. 

DISCUSSION 

The  idea  of  capturing  essential  features  of  the  genome  as  a 
PCR  product  is  not  new.  In  1989,  Kinzler  and  Vogelstein  (27) 
described  “whole  genome”  PCRs  to  select  for  DNA  sequences 
that  were  binding  sites  for  DNA  binding  proteins.  Their 
method  was  used  in  1995  (28)  for  the  same  purpose.  In  the 
experiments  from  1989,  PCR  adaptors  were  blunt-end  ligated 
to  the  cleavage  fragments  of  total  genomic  DNA.  The  useful¬ 
ness  of  the  resulting  PCR  products  for  genomic  analysis  was 
not  explored;  nor  was  its  efficiency  explored  when  starting  with 
tiny  amounts  of  material.  An  alternate  approach  to  whole 
genome  PCR  is  to  use  random  priming  with  degenerate 
oligonucleotides  (29-33).  This  method  can  produce  large 
amounts  of  DNA  starting  from  very  minute  amounts  of 
sample,  but  the  complexity  of  the  DNA  produced,  the  repro¬ 


ducibility  of  the  representation,  and  the  loss  of  natural  restric¬ 
tion  sites  at  the  ends  of  the  amplified  material  make  the 
usefulness  of  this  method  somewhat  limited.  Restriction  en¬ 
donuclease-based  representations  have  major  advantages: 
Their  complexity  can  be  regulated  by  the  choice  of  restriction 
enzyme;  they  can  be  readily  reamplified;  they  can  be  analyzed 
by  Southern  blotting;  and  they  are  highly  reproducible. 

In  this  report,  we  have  explored  the  uses  of  HCRs,  including 
quantitative  assessment  of  copy  number,  LOH,  and  CGH.  All 
of  these  analytical  methods  require  a  high  level  of  reproduc¬ 
ibility  in  the  representation.  To  achieve  this  level  of  reproduc¬ 
ibility,  we  have  prepared  paired  samples  of  HCRs  from  the 
same  amount  of  starting  material,  used  genomic  DNAs  ex¬ 
tracted  in  the  same  manner,  and  performed  PCR  at  the  same 
time,  under  the  same  conditions,  and  in  the  same  thermal 
cycler. 

In  principle,  HCRs  can  be  prepared  from  normal  and  tumor 
tissue  stored  as  fixed,  paraffin-embedded,  archived  biopsies, 
which  would  extend  greatly  the  usefulness  of  such  samples.  We 
have  made  HCRs  from  DNAs  extracted  from  pairs  of  aneu¬ 
ploid  and  diploid  nuclei  sorted  from  such  sources  (R.L., 
unpublished  data).  More  rounds  of  PCR  are  required  to  obtain 
workable  amounts  of  DNA,  and  HCRs  from  DNA  extracted 
from  fixed  specimens  have  a  markedly  lower  size  distribution 
than  HCRs  prepared  from  fresh  sources.  Moreover,  there  is 
enormous  variability  between  the  HCRs  from  different  spec¬ 
imens,  reflected  as  varying  size  distributions.  This  diversity  is 
probably  caused  by  the  variation  in  the  quality  of  the  DNA  that 
can  be  extracted  from  specimens  fixed  and  stored  under 
different  conditions  (19,  34-36).  Despite  the  variability  be¬ 
tween  specimens,  we  found  that  the  HCRs  prepared  from 
aneuploid  and  diploid  nuclei  from  the  same  fixed  specimen  are 
similar  to  each  other.  Indeed,  we  have  found  that  HCRs 
prepared  from  sorted  nuclei  of  fixed  specimens  are  useful  for 
CGH  (unpublished  observations).  These  results  suggest  that 
normal  and  tumor  HCRs  will  be  useful  even  when  prepared  for 
analysis  from  microdissected  specimens. 

Both  HCR  and  LCR  render  paired  tumor  and  normal  DNAs 
from  minute  specimens  in  a  stable  format  that  can  be  analyzed 
or  further  amplified  at  a  later  date.  We  have  emphasized  the 
usefulness  of  HCRs  for  gene  copy,  LOH,  and  global  genomic 
analysis  of  tumor  specimens.  Most  probes,  chosen  at  random, 
will  be  present  in  an  HCR  but  will  not  be  present  in  the  LCRs. 
However,  precisely  because  LCRs  are  of  lower  complexity, 
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Fig.  6.  CGH  analysis  using  HCRs.  Shown  are  two  representative  chromosome  spreads  (Ch  1,  and  Chl7)  comparing  the  genomic  (Gen)  with 
the  HCR  for  two  different  cell  lines,  BT474,  and  MCF7.  Deviation  from  the  dashed  line  represents  genomic  change,  a  peak  represents  gain  in  copy 
number,  and  a  trough  represents  loss  of  copy  number.  Underneath  the  profiles  is  an  idiogram  of  each  chromosome  as  a  reference. 
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hybridization-based  assays  that  depend  on  completeness  of 
hybridization  are  easier  to  perform.  This  ease  is  apparent  in 
blotting  analysis  of  LCRs.  For  the  same  reason,  some  global 
genomic  analyses,  such  as  microchip  array  analysis  (37-40)  or 
CGH  (20-25),  that  depend  on  hybridization  kinetics  should  be 
facilitated  by  the  use  of  LCRs  because  reannealing  times 
should  be  reduced  and  signal-to-noise  ratios  enhanced. 
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Abstract 


Since  their  discovery,  protein  tyrosine  phosphatases  have  been 
speculated  to  play  a  role  in  tumor  suppression  due  to  their  ability  to  antagonize 
the  growth  promoting  protein  tyrosine  kinases.  Recently,  a  tumor  suppressor 
from  human  chromosome  10q23,  called  PTEN  or  MMAC1,  has  been  identified 
that  shares  homology  with  the  protein  tyrosine  phosphatase  family.  Germline 
mutations  in  PTEN  give  rise  to  several  related  neoplastic  disorders,  including 
Cowden  disease.  A  key  step  in  understanding  the  function  of  PTEN  as  a  tumor 
suppressor  is  to  identify  its  physiological  substrates.  Here  we  report  that  a 
missense  mutation  in  PTEN,  PTEN-G129E,  which  is  observed  in  two  Cowden 
disease  kindreds,  specifically  ablates  the  ability  of  PTEN  to  recognize  inositol 
phospholipids  as  a  substrate,  suggesting  that  loss  of  the  lipid  phosphatase 
activity  is  responsible  for  the  etiology  of  the  disease.  Furthermore,  expression 
of  wild  type  or  substrate  trapping  forms  of  PTEN  in  HEK293  cells  altered  the 
levels  of  the  phospholipid  products  of  phosphatidylinositol  3-kinase  and 
ectopic  expression  of  the  phosphatase  in  PTEN-deficient  tumor  cell  lines 
resulted  in  the  inhibition  of  PKB/Akt  and  regulation  of  cell  survival. 
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Introduction 

Glioblastoma  is  one  of  the  most  common  and  malignant  forms  of 
cancer.  It  is  often  characterized  by  the  constitutive  activation  of  EGF-dependent 
signalling  pathways  due  to  the  amplification  of  members  of  the  EGF  receptor 
family  of  protein  tyrosine  kinases  (PTKs).  The  products  of  tumor  suppressor 
genes  may  attenuate  these  signalling  pathways  and,  therefore,  their  loss 
through  deletion  or  mutation  may  contribute  to  tumor  progression.  The 
freqeunt  loss  of  heterozygosity  at  tumor  suppressor  loci  is  often  used  to  identify 
tumor  suppressor  genes.  The  10q23  region  of  human  chromosome  10  is 
frequently  deleted  or  mutated  in  a  wide  variety  of  tumor  types,  most  frequently 
in  glioblastoma,  endometrial  cancer  and  prostate  cancer,  indicating  the 
presence  of  a  tumor  suppressor  gene  at  this  locus.  PTEN  was  subsequently 
identified  as  this  tumor  suppressor  and  was  found  to  contain  the  catalytic 
signature  motif  detected  in  all  members  of  the  protein  tyrosine  phosphatase 
(PTP)  family  (1,  2).  Importantly,  PTEN  appears  to  be  lost  frequently  in  advanced 
cancers  suggesting  that  its  deletion  may  not  be  the  transforming  event  but  that 
PTEN  may  inhibit  other  cellular  functions  necessary  for  tumor  progression  (3, 

4). 

Importantly,  germline  transmission  of  mutations  in  PTEN  was  shown  to 
give  rise  to  a  related  set  of  disorders,  including  Cowden  disease,  that  are 
characterized  by  numerous  small  benign  tumors  and  an  increased  incidence  of 
other  malignant  growths  (5-7).  The  detection  of  germline  mutations  in  these 
neoplastic  disorders  verifies  that  PTEN  is  the  tumor  suppressor  residing  on 
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human  chromosome  10q23.  This  conclusion  is  supported  by  the  recent 
findings  that  PTEN  knockout  mice  develop  tumors  similar  to  those  found  in 
Cowden  disease  (8). 

A  key  step  in  establishing  the  cellular  function  of  PTEN  is  the 
identification  of  its  physiological  substrates.  Recently,  two  groups  have 
identified  different  candidate  substrates  for  PTEN.  Tamura  et  al  report  that 
overexpression  of  PTEN  results  in  changes  in  cell  adhesion  and  spreading  and 
suggest  that  these  effects  result  from  the  the  dephosphorylation  of  the  protein 
tyrosine  kinase  FAK  (9).  In  contrast,  Maehama  and  Dixon  have  shown  that 
PTEN  recognizes  phosphatidylinositol  phosphate  as  a  substrate  (10).  Our  initial 
studies  demonstrating  catalytic  activity  in  PTEN  indicated  that  it  prefered 
highly  acidic  or  multiply  phosphorylated  substrates  (11).  This  may  explain 
why,  in  certain  contexts  PTEN  will  dephosphorylate  poly-phosphorylated 
molecules  such  as  FAK  or  phosphatidylinositol  phosphate  (11,  12). 

Nevertheless,  the  important  question  remains  which  multiply  phosphorylated 
molecules,  protein  or  otherwise,  are  the  physiologically  relevant  targets  of 
PTEN. 

Significantly,  we  have  demonstrated  previously  that  the  majority  of 
missense  mutations  isolated  from  tumor  and  Cowden  disease  samples  ablate 
the  protein  phosphatase  activity  of  PTEN  (11).  One  exception  is  a  missense 
mutation  that  changes  a  glycine  residue  in  the  catalytic  signature  motif  to  a 
glutamate  (PTEN-G129E)  (11).  Although  there  was  no  adverse  effect  on  the 
protein  phosphatase  activity  of  PTEN,  this  G129E  mutation  was  isolated  from 
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two,  independent  Cowden  disease  kindreds,  indicating  that  it  abolishes  the 
tumor  suppressor  activity  of  PTEN  (5,  12).  Therefore,  this  mutation  can  be  used 
as  an  important  indicator  to  determine  whether  a  proposed  function  of  PTEN 
is  specific  for  its  role  as  a  tumor  suppressor. 

We  analyzed  the  PTEN-G129E  Cowden  disease  mutation  and  found  that 
this  mutation  specifically  inhibits  the  recognition  of  phosphatidylinositol 
phosphates  (PtdlnsP)  by  PTEN.  Additionally,  we  show  that  expression  of  PTEN 
in  mammalian  cells  results  in  changes  in  PtdIns(3,4,5)P3  levels  and  expression 
of  PTEN  in  two  independent  glioblastoma  cell  lines  results  in  the  disruption  of 
signalling  downstream  of  PI  3-kinase  to  PKB/Akt  and  Bad.  Significantly,  we 
also  show  that  expression  of  PTEN  in  LnCaP  cells,  a  prostate  tumor  cell  line, 
abrogates  cell  survival  and  that  this  effect  is  inhibited  by  the  expression  of  a 
constituitively  activated  form  of  PKB.  Therefore,  we  believe  that  the 
physiological  function  of  PTEN  is  to  antagonize  signalling  downstream  of  PI  3- 
kinase  by  dephosphorylating  phosphatidylinositol  phosphates. 
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Methods 

PTEN  Phosphatase  Assays.  Recombinant  wild  type  and  mutant  forms  of  PTEN 
were  expressed  in  E.  coli  and  purified  by  glutathione-affinity  chromatography 
(11).  The  purified  proteins  were  assayed  with  polyGluTyr  as  described  (11). 
Release  of  32P*  from  radiolabeled  phosphatidylinositol  phosphates  was 
determined  by  performing  a  modified  Bligh  and  Dyer  extraction  (13).  The 
upper  phase  (containing  inorganic  phosphate)  was  removed,  dried  down  and 
resuspended  in  a  1M  TCA,  1%  ammonium  molybdate  solution.  Following 
extraction  with  2  volumes  of  toluene:isobutylalchohol  (1:1)  the  upper  phase 
was  removed  and  counted.  Site  selectivity  was  determined  by  incubating 
recombinant  PTEN  or  SHIP  (a  gift  from  C.  Emeux,  Free  University,  Brussels)  with 
radiolabeled  PtdIns(3,4,5)P3  and  the  lipid  products  of  these  reactions  were 
analyzed  by  thin  layer  chromatography  or  HPLC  (13).  The  phospholipids  were 
extracted  by  performing  the  modified  Bligh  and  Dyer  extraction  described  and 
the  resulting  lower  phases  were  dried  down,  resuspended  in  20  pi  of 
chloroforimmethanol  (2:1)  and  applied  to  an  oxalate-activated  silica  60  TLC 
plate.  Plates  were  developed  in  methanol/chloroform/water/ammonia 

(100:75:25:15). 

Determination  of  Ptdlns(3,4,5)P3  levels.  HEK293  cells  were  transfected  via 
calcium  phosphate  co-precipitation  with  20  |ig  of  DNA  per  10  cm  dish.  The 
calcium  phosphate:DNA  co-precipitate  was  removed  by  washing  with  PBS  16 
hours  after  addition  and  the  cells  were  returned  to  growth  medium  for  36 
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hours  before  harvesting.  Co-transfection  of  PTEN  and  pllO  constructs  were 
performed  using  9.5  |ig  of  each  plasmid  DNA.  Transfection  efficiency  was 
determined  by  including  a  GFP  expression  plasmid  (1  |ig)  in  all  transfections 
and  was  shown  to  be  80%.  Phospholipids  were  extracted  from  the  cells  exactly 
as  described  (14).  PtdIns(3,4,5)P3  levels  were  assayed  using  a  ligand 

displacement  assay  (14)  and  were  normalized  to  total  protein. 

Expression  of  PTEN  in  Glioblastoma  Cell  lines.  PTEN  retroviral  expression 
vectors  were  constructed  in  pBabePuro  (15).  Following  transfection  into 
packaging  lines,  the  viral  supernatants  were  harvested,  diluted  with  growth 
media  and  incubated  with  U87MG  or  U373MG  cells  for  8  hours  at  32°  C  in  the 
presence  of  polybrene.  Infected  cells  were  selected  with  puromycin  (2  |ig/ml) 
and  drug  resistant  colonies  were  expanded  to  generate  clonal  cell  lines.  Cells 
lysates  and  immunoblots  were  performed  essentially  as  described  (16).  Protein 
levels  were  determined  by  the  method  of  Bradford  using  BSA  as  a  standard  and 
equal  protein  was  loaded  in  each  lane.  Antibodies  to  PTEN  were  generated  in 
rabbits  using  a  C-terminal  peptide  of  PTEN  (ENEPFDEDQHTQITKV) 
conjugated  to  KLH.  Antibodies  to  PKB/AKT,  phospho-PKB/Akt  (specific  to 
phosphorylated  Ser473)  and  phospho-BAD  were  purchased  from  New  England 
Biolabs. 

Expression  of  PTEN  in  LnCaP  Cells.  Cells  were  transfected  using  cationic  lipids 
(Transfast,  Promega)  at  a  DNA:lipid  ratio  of  1.  Cells  were  incubated  with  the 


DNA:lipid  complexes  for  12  hours  and  then  transfected  cells  were  identified 
based  on  the  expression  of  a  co-transfected  GFP  vector  or  by  performing 
immunoflourescence  using  antibodies  to  the  HA-epitope  tag  located  at  the  N- 
terminus  of  PTEN,  essentially  as  described  (17). 
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Results 

Activity  of  PTEN  towards  inositol  phospholipids.  Recombinant  PTEN, 
produced  in  E.  coli  (11),  was  assayed  for  its  ability  to  release  32Pj  from 
radiolabeled  PtdIns(3,4,5)P3  or  polyGluTyr  (Figure  1A).  As  expected  wild  type 
PTEN  catalyzed  the  dephosphorylation  of  both  substrates  (Figure  1A). 
Remarkably,  the  activity  of  PTEN-G129E  toward  PtdIns(3,4,5)P3  was  reduced  by 
-90%  relative  to  the  wild  type  enzyme  while  still  retaining  activity  towards 
polyGluTyr  (Figure  1A).  A  catalytically-inactive  mutant  of  PTEN  (PTENC124S) 
was  unable  to  dephosphorylate  polyGluTyr  or  PtdIns(3,4,5)P3  indicating  that  the 
phosphatase  activity  was  not  due  to  a  bacterial  contaminant.  In  addition,  cdcl4, 
a  dual  specificity  phosphatase  closely  related  to  PTEN,  was  also  unable  to 
dephosphorylate  PtdIns(3,4,5)P3,  demonstrating  that  recognition  of  this 
phospholipid  substrate  is  not  a  general  property  of  other,  even  closely  related, 
dual  specificity  phosphatases.  These  data,  demonstrating  that  the  PTEN-G129E 
mutation  specifically  ablates  the  activity  of  PTEN  towards  phosphorylated 
inositol  lipids,  demonstrates  that  the  lipid  phosphatase,  rather  than  the  protein 
phosphatase,  activity  of  PTEN  is  required  for  it  to  function  as  tumor  suppressor 
and  that  loss  of  the  lipid  phosphatase  activity  results  in  Cowden  disease. 

Site  Specificity  of  PTEN  within  PtdIns(3,4,5)P3.  In  order  to  determine  whether 
PTEN  recognizes  specific  sites  in  the  inositol  ring  of  PtdIns(3,4,5)P3,  lipid 
substrate,  labeled  exclusively  in  the  3-position  with  32P,  was  incubated  with 
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PTEN  or  SHIP,  a  well  characterized  5-phosphatase  (18).  The  products  of  these 
reactions  were  analyzed  by  thin  layer  chromatography.  Incubation  with  SHIP 
yielded  a  radiolabelled  product  with  the  expected  mobility  of  PtdIns(3,4)P2 
(Figure  IB).  However,  no  labeled  lipid  products  were  generated  following 
treatment  with  PTEN,  under  conditions  where  >50%  of  the  32P  was  lost  from 
the  substrate  (Figure  IB).  The  site  selectivity  of  PTEN  was  confirmed  by  HPLC 
analysis  of  similar  reactions  using  [3H]-  and  [3-32P]-labeled  Ins(l,3,4,5)P4  in  which 
the  only  products  detected  were  32P;  and  [3H]-Ins(l,4,5)P3  (data  not  shown).  In 
addition  to  PtdIns(3,4,5)P3,  PTEN  also  hydrolyzed  the  other  potential  products 
of  PI  3-kinase,  PtdIns(3)P  and  PtdIns(3,4)P2,with  the  following  rank  order  based 
on  relative  percent  hydrolysis:  PtdIns(3,4,5)P3  =  PtdIns(3,4)P2  >  PtdIns3P  > 
Ins(l,3,4,5)P4.  The  specificity  of  PTEN  for  3-phosphorylated  inositol  lipids 
indicates  that  it  may  function  as  a  negative  regulator  of  PI  3-kinase-mediated 
signalling  and  distinguishes  PTEN  from  other  known  lipid  phosphatases. 

PTEN  Inhibits  PI  3-kinase-dependent  Signaling.  The  potential  for  PTEN  to 
antagonize  PI  3-kinase  signalling  was  further  investigated  in  mammalian  cells 
(Figure  2).  Transient  expression  of  PTEN  in  HEK293  cells  lowered  the  levels  of 
PtdIns(3,4,5)P3  (Figure  2)  (from  1  pmol/mg  to  0.7  pmol/mg).  Importantly, 
PTEN-C124S,  in  which  the  catalytic  cysteine  has  been  mutated  to  serine, 
resulted  in  an  increase  in  the  levels  of  PtdIns(3,4,5)P3  (Figure  2)  (from  1 
pmol/mg  to  2  pmol/mg).  The  accumulation  of  PtdIns(3,4,5)P3  in  response  to 
expression  of  PTEN-C124S  is  likely  due  to  the  ability  of  this  form  of  PTEN  to 
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behave  as  a  "substrate  trapping"  mutant,  resulting  in  a  stable  complex  with  the 
lipid  substrate  which  protects  it  from  dephosphorylation  by  endogenous 
phosphatases  (19).  The  accumulation  of  PtdIns(3,4,5)P3  in  the  presence  of  a 
substrate  trapping  mutant  confirms  that  PtdIns(3,4,5)P3  is  a  physiological  target 
of  PTEN  (19,  20).  Expression  of  a  constitutively-activated  PI  3-kinase  (pllO- 
CAAX)  resulted  in  elevated  levels  of  PtdIns(3,4,5)P3  that  were  reduced  -60% 
following  co-expression  with  PTEN  and  were  increased  -4.5  fold  when  co¬ 
expressed  with  PTEN-C124S  (Figure  2).  The  ability  of  PTEN  to  decrease 
PtdIns(3,4,5)P3  levels  produced  by  pllO-CAAX  further  indicates  that  this 
phosphatase  exerts  its  effects  on  the  products  of  PI  3-kinase  rather  than  by 
dephosphorylating  the  tyrosine  residues  responsible  for  recruiting  the  p85-pll0 

PI  3-kinase  complex  to  the  membrane. 

Importantly,  we  detected  defects  in  PI  3-kinase  signalling  in  tumor  cell 
lines  that  have  lost  their  endogenous  PTEN  genes  due  to  deletion  or  mutation. 
Signalling  downstream  of  PI  3-kinase  was  assessed  in  two  of  these  PTEN- 
deficient  cell  lines,  U87  MG  and  U373  MG,  by  determining  the  phosphorylation 
status  of  PKB/Akt,  a  kinase  whose  activation  is  dependent  on  PtdIns(3,4,5)P3  or 
PtdIns(3,4,)P2,  two  products  of  PI  3-kinase  (21).  Clonal  cell  lines  were  generated 
in  U87  MG  and  U373  MG  in  which  PTEN  expression  was  restored  by  infection 
with  recombinant  retroviruses  (Figure  3).  The  parental  cell  lines  exhibited 
high  basal  levels  of  phosphorylated  PKB/Akt,  whereas  in  every  wild  type 
PTEN-expressing  clonal  cell  line  the  basal  levels  of  activated  PKB/ Akt  were 
significantly  lowered  (Figure  3).  Expression  of  a  catalytically-inactive  mutant  of 
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PTEN,  PTEN-R130M,  had  no  effect  on  basal  PKB/Akt  activation  (Figure  3). 
Similarly,  no  effect  on  PKB/Akt  activation  was  observed  in  a  U373  MG  (PTEN- 
SRVE  #B9)  clone  infected  with  an  unstable  mutant  of  PTEN  (Figure  3). 
Following  reconstitution  of  PTEN  expression  in  the  tumor  lines,  activation  of 
PKB/Akt  by  insulin  or  PDGF  was  unaffected  (Figure  3),  suggesting  that  there 
are  regulatory  mechanisms  which  allow  for  the  generation  of  PtdIns(3,4,5)P3 
following  growth  factor  stimulation.  The  activation  of  PKB/Akt  was 
completely  inhibited  by  wortmannin  in  the  tumor  cell  lines  (Figure  3) 
indicating  that  the  increase  in  PKB/Akt  phosphorylation  was  PI  3-kinase- 
dependent. 

Regulation  of  Cell  Survival.  One  of  the  downstream  substrates  of  PKB/Akt  is 
the  death  effector  protein  BAD  (22),  phosphorylation  of  which  has  been  linked 
to  the  promotion  of  cell  survival  (23).  Although  expression  of  PTEN  in  both 
U87  MG  and  U373  MG  resulted  in  the  reduction  of  BAD  phosphorylation 
(Figure  3C),  we  did  not  detect  evidence  of  apoptosis  in  the  PTEN-expressing 
clones.  However,  expression  of  wild  type  PTEN  in  LnCaP  cells,  a  PTEN- 
deficient  prostate  cancer  cell  line,  resulted  in  a  decrease  in  the  number  of  PTEN 
positive  cells  recovered  relative  to  controls  expressing  the  catalytically-inactive 
mutant  PTEN-C124S  (Figure  4).  In  this  assay,  the  Cowden  disease  mutation 
PTEN-G129E  behaved  similarly  to  PTEN-C124S,  consistent  with  the  ablation  of 
the  tumor  suppressor  activity  of  PTEN  by  this  mutation  (Figure  4). 
Significantly,  coexpression  of  a  constitutively-active,  membrane-targeted 
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PKB/Akt  (24)  completely  reverted  the  PTEN  phenotype,  in  that  PTEN  positive 
cells  could  now  be  recovered  despite  the  presence  of  wild  type  PTEN  (Figure  4). 
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Discussion 

PTEN  is  a  newly  defined  tumor  suppressor  gene  that  has  been  implicated 
in  a  wide  variety  of  cancers  and  in  a  series  of  related  disorders  that  are 
characterized  by  a  predispostion  to  cancer(l,  2,  25-28).  Although  these  studies 
have  offered  many  insights  into  the  biology  of  PTEN,  they  do  not  address  the 
molecular  mechanism  of  PTEN  function  which  requires  the  identification  of 
its  physiological  substrates.  However,  thishas  become  an  area  of  controversy. 
Tamura  et  al  have  suggested  that  PTEN  disrupts  cell  spreading  and  migration 
by  dephosphorylating  FAK  (9).  In  contrast,  Maehema  and  Dixon  have  shown 
that,  in  certain  contexts,  PTEN  can  dephosphorylate  the  lipid  second  messenger 
phosphatidylinositol  phosphate  (10).  However,  both  of  these  studies  fall  short 
of  demonstrating  that  the  ability  of  PTEN  to  dephosphorylate  FAK  or 
phosphatidylinositol  phosphate  is  related  to  its  tumor  suppressive  function.  In 
light  of  the  facts  that  previous  studes  have  shown  that  PTEN  prefers  highly 
acidic  or  multiply  phosphorylated  substrates  (11)  and  that  FAK  and 
phosphatidylinositol  phosphate  share  these  properties,  one  must  view  these 
results  with  caution  as  they  may  result  from  non-specific  interactions  between 
PTEN  and  highly  charged  molecules  (12). 

Significantly,  we  have  been  able  to  show  that  a  single  point  mutation  in 
PTEN,  PTEN-G129E,  specifically  ablates  the  ability  of  PTEN  to  recognize 
phospholipids  as  substrates.  In  contrast,  Tamura  et  al  found  that  the  PTEN- 
G129E  point  mutation  behaves  essentially  like  wild  type  PTEN  in  their  assays  of 
cell  spreading  (9).  Even  though  this  mutation  does  not  alter  the  ability  of  PTEN 


to  recognize  proteinaceous  substrates,  the  fact  that  it  has  been  identified  in  two 
independent  Cowden  disease  kindreds  indicates  that  this  mutation  would  be 
expected  to  impair  the  tumor  suppressor  activity  of  PTEN  (5,  11).  Therefore, 

these  findings  demonstrate  that  the  lipid  phosphatase  activity  rather  than  the 

\ 

protein  phosphatase  activity  of  PTEN  is  important  for  its  tumor  suppressive 
activity  and  that  FAK,  at  least  in  this  respect,  is  not  a  relavent  target  of  the 
phosphatase. 

Characterization  of  the  lipid  phosphatase  activity  of  PTEN  demonstrated 
that  it  shows  specificity  for  phosphatidylinositols  phosphorylated  at  the  3- 
position.  This  represents  the  first  phosphatidylinositol  phosphatase  with 
specificity  for  the  lipid  products  of  PI  3-kinase  and  suggests  that  PTEN  may 
function  to  antagonize  the  growth  promoting  signals  generated  by  PI  3-kinase. 
Indeed,  overexpression  of  PTEN  in  mammalian  cells  disrupted  the  PI  3-kinase- 
dependent  production  of  PtdIns(3,4,5)P3.  Furthermore,  expression  of  a 
catalytically-inactive  mutant  of  PTEN,  PTEN-C124S,  which  may  function  as  a 
"substrate  trap",  results  in  the  accumulation  of  PtdIns(3,4,5)P3,  indicating  that 
PTEN  may  function  in  vivo  to  antagonize  PI  3-kinase-dependent  signalling. 

Importantly,  signalling  downstream  of  PI  3-kinase  is  altered  in  two 
different  glioblastoma  cell  lines,  U87  MG  and  U373  MG,  which  have  lost 
expression  of  PTEN  due  to  mutation  or  deletion  of  both  alleles.  Both  cell  lines 
exhibited  elevated  levels  of  active,  phosphorylated  PKB/ Akt,  a 
serine /threonine  kinase  whose  activation  requires  the  production  of 
PtdIns(3,4,5)P3  or  PtdIns(3,4)P2  (21,  29).  Restoration  of  PTEN  expression  by 
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retroviral  transduction  resulted  in  the  reduction  of  activated  PKB/Akt  as  is 
evident  from  the  observed  reduction  in  the  phosphorylation  of  PKB/Akt  and 
in  reduced  levels  of  phosphorylated  BAD,  an  apoptosis  regulator  which  is  a 
known  in  vivo  substrate  of  PKB/Akt  (22).  Significantly,  insulin  or  PDGF 
stimulation  of  these  cell  lines  resulted  in  the  stimulation  of  PKB/Akt 
regardless  of  whether  PTEN  expression  had  been  reconstituted,  suggesting  that 
there  are  mechanisms  in  place  which,  in  response  to  growth  factor  stimulation, 
serve  to  regulate  the  activity  of  PTEN  and  allow  the  production  of 
PtdIns(3,4,5)P3.  This  is  especially  important  given  that  PTEN  is  widely 
expressed,  including  cell  lines  which  produce  PtdIns(3,4,5)P3  in  response  to 
insulin. 

Although  the  reconstitution  of  PTEN  decreased  the  levels  of 
phosphorylated  BAD,  we  were  unable  to  detect  evidence  of  apoptosis  in  these 
cell  lines.  PKB/AKT  is  thought  to  affect  cell  survival  in  many  cell  types  by 
phosphorylating  BAD  or  inactivating  GSK3(3  (23,  33).  Reconstitution  of  PTEN 
in  LnCaP  cells,  a  PTEN-deficient  prostate  cancer  cell  line,  resulted  in  the 
inhibition  of  cell  survival.  Importantly,  expression  of  the  PTEN-G129E 
Cowden  disease  point  mutation  does  not  result  in  the  abrogation  of  cell 
survival.  This  indicates  that  the  lipid  phosphatase  activity  of  PTEN  is  required 
for  these  effects  on  cell  survival.  The  properties  of  the  PTEN-G129E  mutant 
emphasize  the  requirement  of  the  lipid  phosphatase  activity  for  the  tumor 
suppressor  function  of  PTEN.  Furthermore,  the  PTEN-dependent  induction  of 
apoptosis  was  completely  inhibited  by  co-expressing  a  constitutively-active. 
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membrane-targeted  form  of  PKB/Akt  indicating  that  PTEN,  by 
dephosphorylating  phosphatidylinositides,  is  an  upstream  regulator  of 
PKB/Akt  and,  at  least  in  LNCAP  cells,  serves  to  regulate  survival  signals. 
Thus,  in  some  cell  types,  PTEN  functions  as  a  tumor  suppressor  to  inhibit  the 
PKB/AKT-dependent  survival  signals  that  are  activated  in  response  to  3- 
phosphorylated  phosphatidylinositols. 

The  failure  of  PTEN  to  regulate  cell  survival  in  the  glioblastoma  cell 
lines  suggests  that  the  generation  of  phosphatidylinositol  3-phosphates  plays 
other  roles  during  tumor  progression.  In  fact,  PI  3-kinase  has  been  implicated 
in  the  regulation  of  a  wide  variety  of  cellular  processes  (30).  Therefore,  by 
dephosphorylating  the  products  of  PI  3-kinase,  PTEN  is  likely  to  be  involved  in 
the  regulation  of  many  of  these  processes.  Significantly,  both  PI  3-kinase  and 
PKB/Akt  have  been  isolated  as  transforming  oncogenes  in  retroviruses  (31,  32). 
These  observations  indicate  that  PI  3-kinase  and  PKB/Akt  function  in  a 
signalling  pathway  promoting  growth  and  survival.  Our  data  demonstrate  that 
PTEN  functions  to  suppress  these  growth  promoting  and  survival  signals  by 
dephosphorylating  the  phospholipid  products  of  PI  3-kinase.  Our  data  provide 
a  molecular  basis  for  the  development  of  Cowden  disease,  as  loss  of  the  lipid 
phosphatase  activity  of  PTEN  leads  to  the  accumulation  of  the  products  of  PI  3- 
kinase  and  results  in  the  neoplastic  pathologies  characteristic  of  the  disease. 
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Figure  Legends 

Figure  1:  PTEN  is  a  phosphatidylinositol  3-phosphatase  A)  Recombinant 
PTEN  was  incubated  with  radiolabeled  polyGluTyr  and  PtdIns(3,4,5)P3  and  the 
release  of  32Pi  was  measured  as  described.  B)  PtdIns(3,4,5)P3  labeled  at  the  3- 
position,  was  incubated  with  PTEN  or  SHIP  for  0  or  60  minutes  and  the 
reaction  products  were  resolved  on  TLC  plates  and  visualized  by 
autoradiography. 

Figure  2:  Expression  of  PTEN  antagonized  PI  3-kinase.  HEK293  cells  were 
transfected  with  PTEN  or  PTENC124S  in  combination  with  the  pllO  subunit  of 
PI  3-kinase  or  an  activated,  membrane-bound  PI-3  kinase  (pllO-CAAX).  The 
resulting  levels  of  PtdIns(3,4,5)P3  were  determined  as  described  and  were 
normalized  to  total  protein.  Data  are  expressed  as  pmol  PtdIns(3,4,5)P3  per  mg 
protein.  These  data,  from  a  single  experiment,  are  representative  of  three 
experiments  which  yielded  similar  results. 

Figure  3:  Expression  of  PTEN  in  glioblastoma  cell  lines  decreases  the  amount 
of  activated  PKB/AKT.  PTEN  expression  was  reconstituted  in  A)  U87MG  or  B) 
U373MB  glioblastoma  cell  lines  by  infecting  with  recombinant  retrovirus. 
Confluent  dishes  were  either  left  untreated,  stimulated  with  insulin  (10  (ig/ml 
for  10  minutes),  or  PDGF  (50  ng/ml  for  10  minutes),  or  pretreated  with 
Wortmannin  (150  nM  for  30  minutes)  and  then  stimulated  with  both  insulin 
(10  |ig/ml)  and  PDGF  (50  ng/ml)  for  10  minutes,  and  then  lysed.  Equal 


amounts  of  protein  were  loaded  for  each  cell  line  and  immunoblots  were 
probed  with  antibodies  to  PTEN,  PKB/ Akt  (PKB),  phospho-PKB/ Akt  (P-PKB), 
or  phosphorylated  BAD  (P-BAD)  and  visualized  using  ECL  reagents. 

Figure  4:  Expression  of  PTEN  inhibits  cell  survival  in  LnCaP  cells.  PTEN, 
PTENC124S  or  PTENG129E,  were  co-transfected  with  a  green  fluorescent 
protein  (GFP)  expression  vector  into  LnCaP  cells.  The  co-transfections  also 
included  either  an  expression  vector  for  consitutively  active  PKB/Akt  (black 
bars)  or  a  control  empty  vector  (striped  bars)  .  Transfected  cells  were  identified 
as  GFP  positive  cells  or  by  immunofluorescence  microscopy  following  staining 
with  antibodies  to  the  transfected  PTEN.  Transfection  efficiency  was  assessed  by 
determining  the  percentage  of  GFP/PTEN  positive  cells  (total  number  of  cells 
was  determined  by  counting  nuclei  (stained  with  DAPI)).  Data  are  expressed  as 
the  mean  transfection  efficiency  (+/-  s.d.  n=3). 
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